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(54) Title: COMPOSITIONS AND METHODS FOR THE THERAPY AND DIAGNOSIS OF LUNG CANCER 

£3 (57) Abstract: Compositions and methods for the therapy and diagnosis of cancer, particularly lung cancer, are disclosed. Illustra- 
tive compositions comprise one or more lung tumor polypeptides, immunogenic portions thereof, polynucleotides that encode such 
© polypeptides, antigen presenting cell that expresses such polypeptides, and T cells that are specific for cells expressing such polypep- 
^ tides. The disclosed compositions are useful, for example, in the diagnosis, prevention and/or treatment of diseases, particularly lung 
^ cancer. 
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COMPOSITIONS AND METHODS FOR THE THERAPY AND 
DIAGNOSIS OF LUNG CANCER 

TECHNICAL FIELD OF THE INVENTION 

The present invention relates generally to therapy and diagnosis of 
5 cancer, such as lung cancer. The invention is more specifically related to polypeptides, 
comprising at least a portion of a lung tumor protein, and to polynucleotides encoding 
such polypeptides. Such polypeptides and polynucleotides are useftd in pharmaceutical 
compositions, e.g., vaccines, and other compositions for the diagnosis and treatment of 
lung cancer, 

10 BACKGROUND OF THE INVENTION 

Field of the Invention 

Cancer is a significant health problem throughout the world. Although 

advances have been made in detection and therapy of caricer, no vaccine or other 

universally successful method for prevention and/or treatment is currently available. 
15 Current therapies, which are generally based on a combination of chemotherapy or 

surgery and radiation, continue to prove inadequate in many patients. 

Description of Related Art 

Lung cancer is the primary cause of cancer death among both men and 

women in the U.S., with an estimated 172,000 new cases being reported in 1994. The 
20 five-year survival rate among all lung cancer patients, regardless of the stage of disease 

at diagnosis, is only 13%. This contrasts with a five-year survival rate of 46% among 

cases detected while the disease is still localized. However, only 16% of lung cancers 

are discovered before the disease has spread. 

In spite of considerable research into therapies for these and other 
25 cancers, lung cancer remains difficult to diagnose and treat effectively. Accordingly, 

there is a need in the art for improved methods for detecting and treating such cancers. 

The present invention fulfills these needs and further provides other related advantages. 
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SUMMARY OF THE INVENTION 

In one aspect, the present invention provides polynucleotide 
compositions comprising a sequence selected from the group consisting of: 

(a) sequences provided in SEQ ID NO:l-3, 6-8, 10-13, 15-27, 29, 30, 
5 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 
111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154, 157, 158, 160, 
167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 213, 214, 217, 
220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 424, 428, 431, 
434, 442, 447, 450 and 467; 

10 (b) complements of the sequences provided in SEQ ID NO:l-3, 6-8, 

10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 
84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 
154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207,209, 
210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 

15 420, 424, 428, 43 1, 434, 442, 447, 450 and 467; 

(c) sequences consisting of at least 1 0, 1 5, 20, 25, 30, 35, 40, 45, 50, 
75 and 100 contiguous residues of a sequence provided in SEQ ID NO:l-3, 6-8, 10-13, 
15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86- 
96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154, 157, 

20 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 213, 
214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 424, 
428, 431, 434, 442, 447, 450 and 467; 

(d) sequences that hybridize to a sequence provided in SEQ ID 
NO:l-3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 

25 77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 
148-151, 153, 154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 
198-207, 209, 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 
368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467, under moderate or 
highly stringent conditions', 

30 (e) sequences having at least 75%, 80%, 85%, 90%, 95%, 96%, 

97%, 98% or 99% identity to a sequence of SEQ ID NO:l-3, 6-8, 10-13, 15-27, 29, 30, 
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32, 3449, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 
111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154, 157, 158, 160, 
167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 213, 214, 217, 
220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 424, 428, 431, 
5 434, 442, 447, 450 and 467; and 

(f) degenerate variants of a sequence provided in SEQ ID NO: 1-3, 6- 
8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80- 
82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 
154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 

10 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 
420, 424, 428, 43 1, 434, 442, 447, 450 and 467. 

In one preferred embodiment, the polynucleotide compositions of the 
invention are expressed in at least about 20%, more preferably in at least about 30%, 
and most preferably in at least about 50% of lung tumors samples tested, at a level that 

15 is at least about 2-fold, preferably at least about 5-fold, and most preferably at least 
about 10-fold higher than that for normal tissues. 

The present invention, in another aspect, provides polypeptide 
compositions comprising an amino acid sequence that is encoded by a polynucleotide 
sequence described above. 

20 The present invention further provides polypeptide compositions 

comprising an amino acid sequence selected from the group consisting of sequences 
recited in SEQ ID NO:152, 155, 156, 165, 166, 169, 170, 172, 174, 176, 226-252, 338- 
344, 346, 350, 357, 361, 363, 365, 367, 369, 376-382, 387-419, 423, 427, 430, 433, 
441, 443, 446, 449, 451-466 and 468-469. 

25 In certain preferred embodiments, the polypeptides and/or 

polynucleotides of the present invention are immunogenic, ie., they are capable of 
eliciting an immune response, particularly a humoral and/or cellular immune response, 
as further described herein. 

The present invention further provides fragments, variants and/or 

30 derivatives of the disclosed polypeptide and/or polynucleotide sequences, wherein the 
fragments, variants and/or derivatives preferably have a level of immunogenic activity 
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of at least about 50%, preferably at least about 70% and more preferably at least about 
90% of the level of immunogenic activity of a polypeptide sequence set forth in SEQ ID 
NO:152, 155, 156, 165, 166, 169, 170, 172, 174, 176, 226-252, 338-344, 346, 350, 357, 
361, 363, 365, 367, 369, 376-382, 387-419, 423, 427, 430, 433, 441, 443, 446, 449 and 
5 451-466, or a polypeptide sequence encoded by a polynucleotide sequence set forth in 
SEQ ID NO:l-3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 
73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 
144, 148-151, 153, 154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 
194, 198-207, 209, 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 
10 365, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467. 

J 

The present invention further provides polynucleotides that encode a 
polypeptide described above, expression vectors comprising such polynucleotides and 
host cells transfonned or transfected with such expression vectors. 

Within other aspects, the present invention provides pharmaceutical 
15 compositions comprising a polypeptide or polynucleotide as described above and a 
physiologically acceptable carrier. 

Within a related aspect of the present invention, the pharmaceutical 
compositions, e.g., vaccine compositions, are provided for prophylactic or therapeutic 
applications. Such compositions generally comprise an immunogenic polypeptide or 
20 polynucleotide of the invention and an immunostimulant, such as an adjuvant 

The present invention further provides pharmaceutical compositions that 
comprise: (a) an antibody or antigen-binding fragment thereof that specifically binds to 
a polypeptide of the present invention, or a fragment thereof; and (b) a physiologically 
acceptable carrier. 

25 Within further aspects, the present invention provides pharmaceutical 

compositions comprising: (a) an antigen presenting cell that expresses a polypeptide as 
described above and (b) a pharmaceutical^ acceptable carrier or excipient. Illustrative 
antigen presenting cells include dendritic cells, macrophages, monocytes, fibroblasts 
and B cells. 
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Within related aspects, pharmaceutical compositions are provided that 
comprise: (a) an antigen presenting cell that expresses a polypeptide as described above 
and (b) an immunostimulant. 

The present invention further provides, in other aspects, fusion proteins 
5 that comprise at least one polypeptide as described above, as well as polynucleotides 
encoding such fusion proteins, typically in the form of pharmaceutical compositions, 
e.g., vaccine compositions, comprising a physiologically acceptable carrier and/or an 
immunostimulant. The fusions proteins may comprise multiple immunogenic 
polypeptides or portions/variants thereof, as described herein, and may further comprise 
10 one or more polypeptide segments for facilitating the expression, purification and/or 
immunogenicity of the polypeptide(s). 

Within further aspects, the present invention provides methods for 
stimulating an immune response in a patient, preferably a T cell response in a human 
patient, comprising administering a pharmaceutical composition described herein. The 
15 patient may be afflicted with lung cancer, in which case the methods provide treatment 
for the disease, or patient considered at risk for such a disease may be treated 
prophylactically. 

Within further aspects, the present invention provides methods for 
inhibiting the development of a cancer in a patient, comprising administering to a 
20 patient a pharmaceutical composition as recited above. The patient may be afflicted 
with lung cancer, in which case the methods provide treatment for the disease, or patient 
considered at risk for such a disease may be treated prophylactically. 

The present invention further provides, within other aspects, methods for 
removing tumor cells from a biological sample, comprising contacting a biological 
25 sample with T cells that specifically react with a polypeptide of the present invention, 
wherein the step of contacting is performed under conditions and for a time sufficient to 
permit the removal of cells expressing the protein from the sample. 

Within related aspects, methods are provided for inhibiting the 
development of a cancer in a patient, comprising administering to a patient a biological 
30 sample treated as described above. 
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Methods are further provided, within other aspects, for stimulating 
and/or expanding T cells specific for a polypeptide of the present invention, comprising 
contacting T cells with one or more of: (i) a polypeptide as described above; (ii) a 
polynucleotide encoding such a polypeptide; and/or (iii) an antigen presenting cell that 
5 expresses such a polypeptide; under conditions and for a time sufficient to permit the 
stimulation and/or expansion of T cells. Isolated T cell populations comprising T cells 
prepared as described above are also provided. 

Within further aspects, the present invention provides methods for 
inhibiting the development of a cancer in a patient, comprising administering to a 
1 0 patient an effective amount of a T cell population as described above. 

The present invention further provides methods for inhibiting the 
development of a cancer in a patient, comprising the steps of: (a) incubating CD4+ 
and/or CD8 + T cells isolated from a patient with one or more of: (i) a polypeptide 
comprising at least an immunogenic portion of polypeptide disclosed herein; (ii) a 
15 polynucleotide encoding such a polypeptide; and (iii) an antigen-presenting cell that 
expressed such a polypeptide; and (b) administering to the patient an effective amount 
of the proliferated T cells, and thereby inhibiting the development of a cancer in the 
patient Proliferated cells may, but need not, be cloned prior to administration to the 
patient 

20 Within further aspects, the present invention provides methods for 

determining the presence or absence of a cancer^ preferably a lung cancer, in a patient 
comprising: (a) contacting a biological sample obtained from a patient with a binding 
agent that binds to a polypeptide as recited above; (b) detecting in the sample an amount 
of polypeptide that binds to the binding agent; and (c) comparing the amount of 

25 polypeptide with a predetermined cut-off value, and therefrom determining the presence 
or absence of a cancer in the patient. Within preferred embodiments, the binding agent 
is an antibody, more preferably a monoclonal antibody. 

The present invention also provides, within other aspects, methods for 
monitoring the progression of a cancer in a patient Such methods comprise the steps 

30 of: (a) contacting a biological sample obtained from a patient at a first point in time 
with a binding agent that binds to a polypeptide as recited above; (b) detecting in the 
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sample an amount of polypeptide that binds to the binding agent; (c) repeating steps (a) 
and (b) using a biological sample obtained from the patient at a subsequent point in 
time; and (d) comparing the amount of polypeptide detected in step (c) with the amount 
detected in step (b) and therefrom monitoring the progression of the cancer in the 
5 patient. 

The present invention further provides, within other aspects, methods for 
determining the presence or absence of a cancer in a patient, comprising the steps of: (a) 
contacting a biological sample obtained from a patient with an oligonucleotide that 
hybridizes to a polynucleotide that encodes a polypeptide of the present invention; (b) 

1 0 detecting in the sample a level of a polynucleotide, preferably mRNA, that hybridizes to 
the oligonucleotide; and (c) comparing the level of polynucleotide that hybridizes to the 
oligonucleotide with a predetermined cut-off Value, and therefrom determining the 
presence or absence of a cancer in the patient Within certain embodiments, the amount 
of mRNA is detected via polymerase chain reaction using, for example, at least one 

15 oligonucleotide primer that hybridizes to a polynucleotide encoding a polypeptide as 
recited above, or a complement of such a polynucleotide. Within other embodiments, 
the amount of mRNA is detected using a hybridization technique, employing an 
oligonucleotide probe that hybridizes to a polynucleotide that encodes a polypeptide as 
recited above, or a complement of such a polynucleotide. 

20 In related aspects, methods are provided for monitoring the progression 

of a cancer in a patient, comprising the steps of: (a) contacting a biological sample 
obtained from a patient with an oligonucleotide that hybridizes to a polynucleotide that 
encodes a polypeptide of the present invention; (b) detecting in the sample an amount of 
a polynucleotide that hybridizes to the oligonucleotide; (c) repeating steps (a) and (b) 

25 using a biological sample obtained from the patient at a subsequent point in time; and 
(d) comparing the amount of polynucleotide detected in step (c) with the amount 
detected in step (b) and therefrom monitoring the progression of the cancer in the 
patient. 

Within further aspects, the present invention provides antibodies, such as 
30 monoclonal antibodies, that bind to a polypeptide as described above, as well as 
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diagnostic kits comprising such antibodies. Diagnostic kits comprising one or more 
oligonucleotide probes or primers as described above are also provided. 

These and other aspects of the present invention will become apparent 
upon reference to the following detailed description. All references disclosed herein are 
5 hereby incorporated by reference in their entirety as if each was incorporated 
individually. 

SEQUENCE IDENTIFIERS 

SEQ ID NO:l is the determined cDNA sequence for LST-S1-2 
SEQ ID NO:2 is the determined cDNA sequence for LST-S1-28 

10 SEQ ID NO:3 is the determined cDNA sequence for LST-S1-90 
SEQ ID NO:4 is the determined cDNA sequence for LST-S1-144 
SEQ ID NO:5 is the determined. cDNA sequence for LST-S1-1 33 
SEQ ID NO:6 is the determined cDNA sequence for LST-S1-169 
SEQ ID NO:7 is the determined cDNA sequence for LST-S2-6 

15 SEQ ID NO:8 is the determined cDNA sequence for LST-S2-1 1 
SEQ ID NO:9 is the determined cDNA sequence for LST-S2-17 
SEQ ID NO: 10 is the determined cDNA sequence for LST-S2-25 
SEQ ID NO:l 1 is the determined cDNA sequence for LST-S2-39 
SEQ ID NO: 12 is a first determined cDNA sequence for LST-S2-43 

20 SEQ ID NO: 13 is a second determined cDNA sequence for LST-S2-43 
SEQ ID NO: 14 is the determined cDNA sequence for LST-S2-65 
SEQ ID NO: 15 is the determined cDNA sequence for LST-S2-68 
SEQ ID NO: 16 is the determined cDNA sequence for LST-S2-72 
SEQ ID NO: 17 is the determined cDNA sequence for LST-S2-74 

25 SEQ ID NO:18 is the determined cDNA sequence for LST-S2-103 
SEQ ID NO:19 is the determined cDNA sequence for LST-S2-N1-1F 
SEQ ID NO:20 is the determined cDNA sequence for LST-S2-N1-2A 
SEQ ID NO:21 is the determined cDNA sequence for LST-S2-N1-4H 
SEQ ID NO:22 is the determined cDNA sequence for LST-S2-N1-5A 

30 SEQ ID NO:23 is the determined cDNA sequence for LST-S2-N1-6B 
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SEQ ID NO:24 is the determined cDNA sequence for LST-S2-N1-7B 
SEQ ID NO:25 is the determined cDNA sequence for LST-S2-N1-7H 
SEQ ID NO:26 is the determined cDNA sequence for LST-S2-N1-8A 
SEQ ID NO:27 is the determined cDNA sequence for LST-S2-N1-8D 
5 SEQ ID NO:28 is the determined cDNA sequence for LST-S2-N1 -9A 
SEQ ID NO:29 is the determined cDNA sequence for LST-S2-N1-9E 
SEQ ID NO:30 is the determined cDNA sequence for LST-S2-N1-10A 
SEQ ID NO:31 is the determined cDNA sequence for LST-S2-N1-10G 
SEQ ID NO:32 is the determined cDNA sequence for LST-S2-N1-1 1 A 

10 SEQ ID NO:33 is the determined cDNA sequence for LST-S2-N1-12C 
SEQ ID NO:34 is the detennined cDNA sequence for LST-S2-N1-12E 
SEQ ID NO:35 is the determined cDNA sequence for LST-S2-B1-3D 
SEQ IDNO:36 is the determined cDNA sequence for LST-S2-B1-6C 
SEQ ID NO:37 is the determined cDNA sequence for LST-S2-B1-5D 

15 SEQ ID NO:38 is the determined cDNA sequence for LST-S2-B1-5F 
SEQ IDNO:39 is the detennined cDNA sequence for LST-S2-B1-6G 
SEQ ID NO:40 is the determined cDNA sequence for LST-S2-B1 -8A 
SEQ ID NO:41 is the determined cDNA sequence for LST-S2-B1-8D 
SEQ ID NO:42 is the determined cDNA sequence for LST-S2-B1-10A 

20 SEQ ID NO:43 is the detennined cDNA sequence for LST-S2-B 1 -9B 
SEQ ID NO:44 is the determined cDNA sequence for LST-S2-B1-9F 
SEQ ID NO:45 is the determined cDNA sequence for LST-S2-B1-12D 
SEQ ID NO:46 is the determined cDNA sequence for LST-S2-I2-2B 
SEQ ID NO:47 is the determined cDNA sequence for LST-S2-I2-5F 

25 SEQ ID NO:48 is the detennined cDNA sequence for LST-S2-I2-6B 
SEQ ID NO:49 is the determined cDNA sequence for LST-S2-I2-7F 
SEQ ID NO:50 is the determined cDNA sequence for LST-S2-I2-8G 
SEQ ID NO:51 is the determined cDNA sequence for LST-S2-I2-9E 
SEQ ID NO:52 is the determined cDNA sequence for LST-S2-I2-12B 

30 SEQ ID NO:53 is the determined cDNA sequence for LST-S2-H2-2C 
SEQ ID NO:54 is the determined cDNA sequence for LST-S2-H2-1G 
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SEQ ID NO:55 is the detennined cDNA sequence for LST-S2-H2-4G 
SEQ ID NO:56 is the determined cDNA sequence for LST-S2-H2-3H 
SEQ ID NO:57 is the determined cDNA sequence for LST-S2-H2-5G 
SEQ ID NO:58 is the determined cDNA sequence for LST-S2-H2-9B 
5 SEQ ID NO:59 is the determined cDNA sequence for LST-S2-H2-10H 
SEQ ID NO:60 is the determined cDNA sequence for LST-S2-H2-12D 
SEQ ID NO: 61 is the determined cDNA sequence for LST-S3-2 
SEQ ID NO: 62 is the determined cDNA sequence for LST-S3-4 
SEQ ID NO: 63 is the determined cDNA sequence for LST-S3-7 

10 SEQ ID NO: 64 is the determined cDNA sequence for LST-S3-8 
SEQ ID NO: 65 is the detennined cDNA sequence for LST-S3-12 
SEQ ID NO: 66 is the determined cDNA sequence for LST-S3-13 
SEQ ID NO: 67 is the determined cDNA sequence for LST-S3-14 
SEQ ID NO: 68 is the determined cDNA sequence for LST-S3-16 

15 SEQ ID NO: 69 is the determined cDNA sequence for LST-S3-21 
SEQ ID NO: 70 is the determined cDNA sequence for LST-S3-22 
SEQ ID NO: 71 is the determined cDNA sequence for LST-S1-7 
SEQ ID NO: 72 is the determined cDNA sequence for LST-S1-A-1E 
SEQ ID NO: 73 is the determined cDNA sequence for LST-S1-A-1G 

20 SEQ ID NO: 74 is the determined cDNA sequence for LST-S1-A-3E 
SEQ ID NO: 75 is the determined cDNA sequence for LST-S1-A-4E 
SEQ ID NO: 76 is the detennined cDNA sequence for LST-S1-A-6D 
SEQ ID NO: 77 is the determined cDNA sequence for LST-S1-A-8D 
SEQ ID NO: 78 is the determined cDNA sequence for LST-S1-A-10A 

25 SEQ ID NO: 79 is the determined cDNA sequence for LST-S1-A-10C 
SEQ ID NO: 80 is the determined cDNA sequence for LST-S1-A-9D 
SEQ ID NO: 81 is the determined cDNA sequence for LST-S1-A-10D 
SEQ ID NO: 82 is the determined cDNA sequence for LST-S1-A-9H 
SEQ ID NO: 83 is the determined cDNA sequence for LST-S1-A-1 ID 

30 SEQ ID NO: 84 is the determined cDNA sequence for LST-S1-A-12D 
SEQ ID NO: 85 is the determined cDNA sequence for LST-S1-A-1 IE 
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SEQ ID NO: 86 is the determined cDNA sequence for LST-S1-A-12E 
SEQ ID NO: 87 is the determined cDNA sequence for L5 13S (T3). 
SEQ ID NO: 88 is the determined cDNA sequence for L513S contig 1. 
SEQ ID NO: 89 is a first determined cDNA sequence for L514S. 
5 SEQ ID NO: 90 is a second determined cDNA sequence for L5 14S. 
SEQ ID NO: 91 is a first determined cDNA sequence for L516S. 
SEQ ID NO: 92 is a second determined cDNA sequence for L516S. 
SEQ ID NO: 93 is the determined cDNA sequence for L517S. 

SEQ ID NO: 94 is the extended cDNA sequence for LST-S1-169 (also known as 
10 L519S). 

SEQ ID NO: 95 is a first determined cDNA sequence for L520S. 

SEQ ID NO: 96 is a second determined cDNA sequence for L520S. 

SEQ ID NO: 97 is a first determined cDNA sequence for L521S. 

SEQ ID NO: 98 is a second determined cDNA sequence for L521S. 
1 5 SEQ ID NO: 99 is the determined cDNA sequence for L522S. 

SEQ ED NO: 100 is the determined cDNA sequence for L523S. 

SEQ ID NO: 101 is the determined cDNA sequence for L524S. 

SEQ ID NO: 102 is the determined cDNA sequence for L525S. 

SEQ ID NO: 103 is the determined cDNA sequence for L526S. 
20 SEQ ID NO: 104 is the determined cDNA sequence for L527S. 

SEQ ID NO: 105 is the determined cDNA sequence for L528S. 

SEQ ID NO: 106 is the determined cDNA sequence for L529S. 

SEQ ID NO: 107 is a first determined cDNA sequence for L530S. 

SEQ ID NO: 108 is a second determined cDNA sequence for L530S. 
25 SEQ ID NO: 1 09 is the determined full-length cDNA sequence for L53 1 S short form 

SEQ ID NO: 1 10 is the amino acid sequence encoded by SEQ ID NO: 109. 

SEQ ED NO: 1 1 1 is the determined fiill-length cDNA sequence for L53 IS long form 

SEQ ID NO: 1 12 is the amino acid sequence encoded by SEQ ID NO: 111. 

SEQ ID NO: 1 13 is the determined full-length cDNA sequence for L520S. 
30 SEQ ID NO: 1 14 is the amino acid sequence encoded by SEQ ID NO: 1 13. 

SEQ ID NO: 1 1 5 is the determined cDNA sequence for contig 1 . 
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SEQ ID NO: 1 16 is the determined cDNA sequence for contig 3. 
SEQ ID NO: 1 17 is the determined cDNA sequence for contig 4. 
SEQ ID NO: 1 18 is the determined cDNA sequence for contig 5. 
SEQ ID NO: 1 19 is the determined cDNA sequence for contig 7. 
5 SEQ ID NO: 120 is the determined cDNA sequence for contig 8. 
SEQ ID NO: 121 is the determined cDNA sequence for contig 9. 
SEQ ID NO: 122 is the determined cDNA sequence for contig 1 0. 
SEQ ID NO: 123 is the determined cDNA sequence for contig 12. 
SEQ ID NO: 124 is the determined cDNA sequence for contig 1 1 . 
10 SEQ ID NO: 125 is the determined cDNA sequence for contig 13 (also known as 
L761P). 

SEQ ID NO: 126 is the determined cDNA sequence for contig 15. 

SEQ ID NO: 127 is the determined cDNA sequence for contig 16. 

SEQ ID NO: 128 is the determined cDNA sequence for contig 1 7. 
15 SEQ ID NO: 129 is the determined cDNA sequence for contig 19. 

SEQ ID NO: 130 is the determined cDNA sequence for contig 20. 

SEQ ED NO: 131 is the determined cDNA sequence for contig 22. 

SEQ ID NO: 132 is the determined cDNA sequence for contig 24. 

SEQ ID NO: 133 is the determined cDNA sequence for contig 29. 
20 SEQ ID NO: 1 34 is the determined cDN A sequence for contig 3 1 . 

SEQ ID NO: 135 is the determined cDNA sequence for contig 33. 

SEQ ED NO: 136 is the determined cDNA sequence for contig 38. 

SEQ ID NO: 137 is the determined cDNA sequence for contig 39. 

SEQ ID NO: 138 is the determined cDNA sequence for contig 41. 
25 SEQ ID NO: 1 39 is the determined cDNA sequence for contig 43 . 

SEQ ID NO: 140 is the determined cDNA sequence for contig 44. 

SEQ ED NO: 141 is the determined cDNA sequence for contig 45. 

SEQ ID NO: 142 is the determined cDNA sequence for contig 47. 

SEQ ED NO:- 143 is the determined cDNA sequence for contig 48. 
30 SEQ ID NO: 144 is the determined cDNA sequence for contig 49. 

SEQ ID NO: 145 is the determined cDNA sequence for contig 50. 
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SEQ ID NO: 146 is the determined cDNA sequence for contig 53. 
SEQ ID NO: 147 is the determined cDNA sequence for contig 54. 
SEQ ID NO: 148 is the determined cDNA sequence for contig 56. 
' SEQ ID NO: 149 is the determined cDNA sequence for contig 57. 
5 SEQ ID NO: 1 50 is the determined cDNA sequence for contig 58. 
SEQ ID NO: 151 is the full-length cDNA sequence for L530S. 
SEQ ID NO: 152 is the amino acid sequence encoded by SEQ ID NO: 151 
SEQ ID NO: 153 is the ftdl-length cDNA sequence of a first variant of L514S 
SEQ ID NO: 154 is the full-length cDNA sequence of a second variant of L514S 
10 SEQ ID NO: 155 is the amino acid sequence encoded by SEQ ID NO: 153. 
SEQ ID NO: 156 is the amino acid sequence encoded by SEQ ID NO: 154. 
SEQ ID NO: 157 is the determined cDNA sequence for contig 59. 
SEQ ID NO: 158 is the full-length cDNA sequence for L763P (also referred to as 
contig 22). 

15 SEQ ID NO: 159 is the amino acid sequence encoded by SEQ ID NO: 158. 

SEQ ID NO: 160 is the full-length cDNA sequence for L762P (also referred to as contig 
17). 

SEQ ID NO: 161 is the amino acid sequence encoded by SEQ ID NO: 160. 

SEQ ID NO: 162 is the determined cDNA sequence for L515S. 
20 SEQ ID NO: 163 is the full-length cDNA sequence of a first variant of L524S. 

SEQ ID NO: 164 is the full-length cDNA sequence of a second variant of L524S. 

SEQ ID NO: 1 65 is the amino acid sequence encoded by SEQ ID NO: 1 63 . 

SEQ ID NO: 166 is the amino acid sequence encoded by SEQ ID NO: 164. 

SEQ ID NO: 167 is the full-length cDNA sequence of a first variant of L762P. 
25 SEQ ID NO: 1 68 is the full-length cDNA sequence of a second variant of L762P. 

SEQ ID NO: 169 is the amino acid sequence encoded by SEQ ID NO: 167. 

SEQ ID NO: 170 is the amino acid sequence encoded by SEQ ID NO: 168. 

SEQ ID NO: 171 is the full-length cDNA sequence for L773P (also referred to as contig 

56). 

30 SEQ ID NO: 172 is the amino acid sequence encoded by SEQ ID NO: 171. 
SEQ ID NO: 173 is an extended cDNA sequence for L5 19S. 
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SEQ ID NO: 174 is the amino acid sequence encoded by SEQ ID NO: 174. 

SEQ ID NO: 175 is the full-length cDNA sequence for L523S. 

SEQ ID NO: 176 is the amino acid sequence encoded by SEQ ID NO: 175. 

SEQ ID NO: 177 is the determined cDNA sequence for LST-sub5-7A. 
5 SEQ ID NO: 178 is the determined cDNA sequence for LST-sub5-8G. 

SEQ ID NO: 179 is the determined cDNA sequence for LST-sub5-8H. 

SEQ ID NO: 180 is the determined cDNA sequence for LST-sub5-10B. 

SEQ ID NO: 181 is the determined cDNA sequence for LST-sub5-10H. 

SEQ ID NO: 182 is the determined cDNA sequence for LST-sub5-12B. 
1 0 SEQ ID NO: 1 83 is the determined cDNA sequence for LST-sub5- 1 1 C. 

SEQ ID NO: 184 is the determined cDNA sequence for LST-sub6-lc. 

SEQ ID NO: 185 is the determined cDNA sequence for LST-sub6-2f. 

SEQ ID NO: 186 is the determined cDNA sequence for LST-sub6-2G. 

SEQ ID NO: 187 is the determined cDNA sequence for LST-sub6-4d. 
1 5 SEQ ID NO: 1 88 is the determined cDNA sequence for LST-sub6-4e. 

SEQ ID NO: 189 is the determined cDNA sequence for.LST-sub6-4f. 

SEQ ID NO: 190 is the determined cDNA sequence for LST-sub6-3h. 

SEQ ED NO: 191 is the determined cDNA sequence for LST-sub6-5d. 

SEQ ID NO: 192 is the determined cDNA sequence for LST-sub6-5L 
20 SEQ ID NO: 1 93 is the determined cDNA sequence for LST-sub6-6h. 

SEQ ID NO: 194 is the determined cDNA sequence for LST-sub6-7a. 

SEQ ID NO: 195 is the determined cDNA sequence for LST-sub6-8a 

SEQ ID NO: 196 is the determined cDNA sequence for LST-sub6-7d. 

SEQ ID NO: 197 is the determined cDNA sequence for LST-sub6-7e. 
25 SEQ ID NO: 198 is the determined cDNA sequence for LST-sub6-8e. 

SEQ ID NO: 199 is the determined cDNA sequence for LST-sub6-7g. 

SEQ ID NO: 200 is the determined cDNA sequence for LST-sub6-9£ 

SEQ ID NO: 201 is the determined cDNA sequence for LST-sub6-9h. 

SEQ ID NO: 202 is the determined cDNA sequence for LST-sub6-l lb. 
30 SEQ ID NO: 203 is the determined cDNA sequence for LST-sub6-l lc. 

SEQ ID NO: 204 is the determined cDNA sequence for LST-sub6-12c. 
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SEQ ID NO: 205 is the determined cDNA sequence for LST~sub6-12e. 

SEQ ID NO: 206 is the determined cDNA sequence for LST-sub6-12f. 

SEQ ID NO: 207 is the determined cDNA sequence for LST-sub6-l Ig. 

SEQ ID NO: 208 is the determined cDNA sequence for LST-sub6-12g. 
5 SEQ ID NO: 209 is the determined cDNA sequence for LST-sub6-12h. 

SEQ ID NO: 210 is the determined cDNA sequence for LST-sub6~II-la. 

SEQ ID NO: 21 1 is the determined cDNA sequence for LST-sub6-H-2b. 

SEQ ID NO: 212 is the determined cDNA sequence for LST-sub6-II-2g. 

SEQ ID NO: 213 is the determined cDNA sequence for LST-sub6-II-lh. 
10 SEQ ID NO: 2 14 is the determined cDNA sequence for LST^sub6-II-4a. 

SEQ ID NO: 215 is the determined cDNA sequence for LST-sub6-lI-4b. 

SEQ ED NO: 216 is the determined cDNA sequence for LST-sub6-II-3e. 

SEQ ID NO: 217 is the determined cDNA sequence for LST-sub6-II-4f. 

SEQ ID NO: 218 is the determined cDNA sequence for LST-sub6-IL4g. 
15 SEQ ID NO: 219 is the determined cDNA sequence for LST-sub6-II-4h. 

SEQ ID NO: 220 is the determined cDNA sequence for LST-sub6-II-5c. 

SEQ ID NO: 221 is the determined cDNA sequence for LST-sub6-II-5e. 

SEQ ID NO: 222 is the determined cDNA sequence for LST-sub6-II-6f. 

SEQ ID NO: 223 is the determined cDNA sequence for LST-sub6-H-5g. 
20 SEQ ID NO: 224 is the determined cDNA sequence for LST-sub6-II-6g. 

SEQ ID NO: 225 is the amino acid sequence for L528S. 

SEQ ID NO: 226-251 are synthetic peptides derived from L762P. 

SEQ ID NO: 252 is the expressed amino acid sequence of L514S. 

SEQ ID NO: 253 is the DNA sequence corresponding to SEQ ID NO: 252. 
25 SEQ ID NO: 254 is the DNA sequence of a L762P expression construct. 

SEQ ID NO: 255 is the determined cDNA sequence for clone 23785. 

SEQ ID NO: 256 is the determined cDNA sequence for clone 23786. 

SEQ ID NO: 257 is the determined cDNA sequence for clone 23788. 

SEQ ID NO: 258 is the determined cDNA sequence for clone 23790. 
30 SEQ ID NO: 259 is the determined cDNA sequence for clone 23793. 

SEQ ID NO: 260 is the determined cDNA sequence for clone 23794. 
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SEQ ID NO: 261 is the determined cDNA sequence for clone 23795. 

SEQ ID NO: 262 is the determined cDNA sequence for clone 23796. 

SEQ ID NO: 263 is .the determined cDNA sequence for clone 23797. 

SEQ ID NO: 264 is the determined cDNA sequence for clone 23798. 
5 SEQ ID NO: 265 is the determined cDNA sequence for clone 23799. 

SEQ ID NO: 266 is the determined cDNA sequence for clone 23800. 

SEQ ID NO: 267 is the determined cDNA sequence for clone 23802. 

SEQ ID NO: 268 is the determined cDNA sequence for clone 23803. 

SEQ ID NO: 269 is the determined cDNA sequence for clone 23804. 
10 SEQ ID NO: 270 is the determined cDNA sequence for clone 23805. 

SEQ ID NO: 271 is the determined cDNA sequence for clone 23806. 

SEQ ID NO: 272 is the determined cDNA sequence for clone 23807. 

SEQ ID NO: 273 is the determined cDNA sequence for clone 23808. 

SEQ ID NO: 274 is the determined cDNA sequence for clone 23809. 
15 SEQ ID NO: 275 is the determined cDNA sequence for clone 23810. 

SEQ ID NO: 276 is the determined cDNA sequence for clone 2381 1. 

SEQ ID NO: 277 is the determined cDNA sequence for clone 23812. 

SEQ ID NO: 278 is the determined cDNA sequence for clone 23813. 

SEQ ID NO: 279 is the determined cDNA sequence for clone 23815. 
20 SEQ ID NO: 280 is the determined cDNA sequence for clone 25298. 

SEQ ID NO: 281 is the determined cDNA sequence for clone 25299. 

SEQ ID NO: 282 is the determined cDNA sequence for clone 25300. 

SEQ ID NO: 283 is the determined cDNA sequence for clone 25301 

SEQ ID NO: 284 is the determined cDNA sequence for clone 25304 
25 SEQ ID NO: 285 is the determined cDNA sequence for clone 25309. 

SEQ ID NO: 286 is the determined cDNA sequence for clone 25312. 

SEQ ID NO: 287 is the determined cDNA sequence for clone 25317. 

SEQ ID NO:288 is the determined cDNA sequence for clone 25321 . 

SEQ ID NO:289 is the determined cDNA sequence for clone 25323. 
30 SEQ ID NO:290 is the determined cDNA sequence for clone 25327. 

SEQ ID NO:291 is the determined cDNA sequence for clone 25328. 



16 



WO 02/47534 



PCT/US01/47576 



SEQ ID NO:292 is the determined cDNA sequence for clone 25332. 

SEQ ID NO:293 is the determined cDNA sequence for clone 25333. 

SEQ ID NO:294 is the determined cDNA sequence for clone 25336. 

SEQ ID NO:295 is the determined cDNA sequence for clone 25340. 
5 SEQ ID NO:296 is the determined cDNA sequence for clone 25342. 

SEQ ID NO:297 is the determined cDNA sequence for clone 25356. 

SEQ ID NO:298 is the determined cDNA sequence for clone 25357. 

SEQ ID NO:299 is the determined cDNA sequence for clone 25361. 

SEQ ID NO;300 is the determined cDNA sequence for clone 25363. 
1 0 SEQ ID NO:301 is the determined cDNA sequence for clone 25397. 

SEQ ID NO:302 is the determined cDNA sequence for clone 25402. 

SEQ ID NO:303 is the determined cDNA sequence for clone 25403. 

SEQ ID NO:304 is the determined cDNA sequence for clone 25405. 

SEQ ID NO:305 is the determined cDNA sequence for clone 25407. 
15 SEQ ID NO:306 is the determined cDNA sequence for clone 25409. 

SEQ ID NO:307 is the determined cDNA sequence for clone 25396. 

SEQ ID NO:308 is the determined cDNA sequence for clone 25414. 

SEQ ID NO:309 is the determined cDNA sequence for clone 25410. 

SEQ ID NO.310 is the determined cDNA sequence for clone 25406. 
20 SEQ ID NO:3 1 1 is the determined cDNA sequence for clone 25306. 

SEQ ID NO:312 is the determined cDNA sequence for clone 25362. 

SEQ ID NO:313 is the determined cDNA sequence for clone 25360. 

SEQ ID NO:314 is the determined cDNA sequence for clone 25398. 

SEQ ID NO:3 15 is the determined cDNA sequence for clone 25355. 
25 SEQ ID NO:3 16 is the determined cDNA sequence for clone 2535 1 . 

SEQ ED NO:3 1 7 is the determined cDNA sequence for clone 2533 1 . 

SEQ ID NO:3 18 is the determined cDNA sequence for clone 25338. 

SEQ ID NO:3 19 is the determined cDNA sequence for clone 25335. 

SEQ ID NO:320 is the determined cDNA sequence for clone 25329. 
30 SEQ ID NO:32 1 is the determined cDNA sequence for clone 25324. 

SEQ ID NO:322 is the determined cDNA sequence for clone 25322. 
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SEQ ID NO:323 is the determined cDNA sequence for clone 253 19. 
SEQ ID NO:324 is the determined cDNA sequence for clone 25316. 
SEQ ID NO:325 is the determined cDNA sequence for clone 2531 1. 
SEQ ID NO:326 is the determined cDNA sequence for clone 253 1 0. 
5 SEQ ID NO:327 is the determined cDNA sequence for clone 25302. 
SEQ ID NO:328 is the determined cDNA sequence for clone 25315. 
SEQ ID NO:329 is the determined cDNA sequence for clone 25308. 
SEQ ID NO:330 is the determined cDNA sequence for clone 25303. 
SEQ ID NO:33 1-337 are the cDNA sequences of isoforms of the p53 tumor suppressor 
1 0 homologue, p63 (also referred to as L530S). 

SEQ ID NO:338-344 are the amino acid sequences encoded by SEQ ID NO:331-337, 
respectively 

SEQ ID NO:345 is a second cDNA sequence for the antigen L763P. 
SEQ ID NO:346 is the amino acid sequence encoded by the sequence of SEQ ID NO: 
15 345. 

SEQ ID NO:347 is a determined full-length cDNA sequence for L523S. 
SEQ ID NO:348 is the amino acid sequence encoded by SEQ ID NO: 347. 
SEQ ID NO:349 is the cDNA sequence encoding the N-terminal portion of L773P. 
SEQ ID NO:350 is the amino acid sequence of the N-terminal portion of L773P. 
20 SEQ ID NO:351 is the DNA sequence for a fusion of Ral2 and the N-terminal portion 
ofL763P. 

SEQ ID NO:352 is the amino acid sequence of the fusion of Ral2 and the N-terminal 
portion of L763P. 

SEQ ID NO:353 is the DNA sequence for a fusion of Ral2 and the C-terminal portion 
25 ofL763P. 

SEQ ID NO:354 is the amino acid sequence of the fusion of Ral2 and the C-terminal 
portion ofL763P. 
SEQ ID NO:355 is a primer. 
SEQ ID NO:356 is a primer. 
30 SEQ ID NO:357 is the protein sequence of expressed recombinant L762P. 
SEQ ID NO:358 is the DNA sequence of expressed recombinant L762P. 
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SEQ ID NO:359 is a primer. 
SEQ ED NO:360 is a primer. 

SEQ ID NO.361 is the protein sequence of expressed recombinant L773P A. 
SEQ ID NO:362 is the DNA sequence of expressed recombinant L773P A. 
5 SEQ ID NO:363 is an epitope derived from clone L773P polypeptide. 

SEQ ID NO:364 is a polynucleotide encoding the polypeptide of SEQ ID NO:363. 
SEQ ID NO:365 is an epitope derived from clone L773P polypeptide. 
SEQ ID NO:366 is a polynucleotide encoding the polypeptide of SEQ ID NO:365. 
SEQ ID NO:367 is an epitope consisting of amino acids 571-590 of SEQ ID NO:161, 
10 clone L762P. 

SEQ ID NO:368 is the full-length DNA sequence for contig 13 (SEQ ID NO:125), also 
referred to as L761P. 

SEQ ED NO:369 is the protein sequence encoded by the DNA sequence of SEQ ED 
NO:368. 

15 SEQ ED NO:370 is an L762P DNA sequence from nucleotides 2071-2130. 

SEQ ED NO:371 is an L762P DNA sequence from nucleotides 1441-1500. 

SEQ ED NO:372 is an L762P DNA sequence from nucleotides 1936-1955. 

SEQ ED NO:373 is an L762P DNA sequence from nucleotides 2620-2679. 

SEQ ED NO:374 is an L762P DNA sequence from nucleotides 1801-1860. 
20 SEQ ED NO:375 is an L762P DNA sequence from nucleotides 153 1-1591 . 

SEQ ID NO:376 is the amino acid sequence of the L762P peptide encoded by SEQ ID 

NO:373. 

SEQ ID NO:377 is the amino acid sequence of the L762P peptide encoded by SEQ ED 
NO:370. 

25 SEQ ID NO:378 is the amino acid sequence of the L762P peptide encoded by SEQ ID 
NO:372. 

SEQ ID NO:379 is the amino acid sequence of the L762P peptide encoded by SEQ ED 
NO:374. 

SEQ ID NO-.380 is the amino acid sequence of the L762P peptide encoded by SEQ ID 
30 NO:371. 
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SEQ ED NO:381 is the amino acid sequence of the L762P peptide encoded by SEQ ID 
NO:375. 

SEQ ID NO:382 is the amino acid sequence of an epitope of L762P. 

SEQ ID NO:383-386 are PGR primers. 
5 SEQ ID NO:387-395 are the amino acid sequences of L773P peptides. 

SEQ ID NO:396-419 are the amino acid sequences of L523S peptides, 

SEQ ID NO:420 is the determined cDNA sequence for clone #19014. 

SEQ ID NO:421 is the forward primer PDM-278 for the L514S-13160 coding region. 

SEQ ID NO:422 is the reverse primer PDM-278 for the L514S-13160 coding region. 
10 SEQ ID NO:423 is the amino acid sequence for the expressed recombinant L514S. 

SEQ ID NO:424 is the DNA coding sequence for the recombinant L5 14S. 

SEQ ID NO:425 is the forward primer PDM-414 for the L523S coding region. 

SEQ ID NO:426 is the reverse primer PDM-414 for the L523S coding region. 

SEQ ID NO:427 is the amino acid sequence for the expressed recombinant L523S. 
1 5 SEQ ID NO:428 is the DNA coding sequence for the recombinant L523S. 

SEQ ID NO:429 is the reverse primer PDM-279 for the L762PA coding region. 

SEQ ID NO:430 is the amino acid sequence for the expressed recombinant L762PA. 

SEQ ID NO:43 1 is the DNA coding sequence for the recombinant L762PA. 

SEQ ID NO:432 is the reverse primer PDM-300 for the L773P coding region. 
20 SEQ ID NO:433 is the amino acid sequence of the expressed recombinant L773P. 

SEQ ID NO:434 is the DNA coding sequence for the recombinant L773P. 

SEQ ID NO:435 is the forward primer for TCR Valpha8. 

SEQ ID NO:436 is the reverse primer for TCR Valpha8. 

SEQ ID NO:437 is the forward primer for TCR Vbeta8. 
25 SEQ ID NO:438 is the reverse primer for TCR Vbeta8. 

SEQ ID NO:439 is the TCR Valpha DNA sequence of the TCR clone specific for the 

lung antigen L762P. 

SEQ ID NO:440 is the TCR Vbeta DNA sequence of the TCR clone specific for the 
lung antigen L762P. 
30 SEQ ID NO:441 is the amino acid sequence of L763 peptide #2684. 
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SEQ ID NO:442 is the predicted full-length cDNA for the cloned partial sequence of 
clone L529S (SEQ ID NO:106). 

SEQ ID NO:443 is the deduced amino acid sequence encoded by SEQ ID NO:442. 
SEQ ID NO:444 is the forward primer PDM-734 for the coding region of clone L523S. 
5 SEQ ID NO:445 is the reverse primer PDM-735 for the coding region of clone L523S. 
SEQ ID NO:446 is the amino acid sequence for the expressed recombinant L523S. 
SEQ ID NO:447 is the DNA coding sequence for the recombinant L523S. 
SEQ ID NO:448 is another forward primer PDM-733 for the coding region of clone 
L523S. 

10 SEQ ID NO:449 is the amino acid sequence for a second expressed recombinant L523S. 
SEQ ID NO:450 is the DNA coding sequence for a second recombinant L523S. 
SEQ ID NO:451 corresponds to amino acids 86-110, an epitope of L514S-specific in 
the generation of antibodies. 

SEQ ID NO:452 corresponds to amino acids 21-45, an epitope of L514S-specific in the 
1 5 generation of antibodies. 

SEQ ID NO:453 corresponds to amino acids 121-135, an epitope of L514S-specific in 
the generation of antibodies. ' 

SEQ ID NO:454 corresponds to amino acids 440-460, an epitope of L523S-specific in 
the generation of antibodies. 
20 SEQ ID NO:455 corresponds to amino acids 156-175, an epitope of L523S-specific in 
the generation of antibodies. 

SEQ ID NO:456 corresponds to amino acids 326-345, an epitope of L523S-specific in 
the generation of antibodies. 

SEQ ID NO:457 corresponds to amino acids 40-59, an epitope of L523S-specific in the 
25 generation of antibodies. 

SEQ ID NO:458 corresponds to amino acids 80-99, an epitope of L523S-specific in the 
generation of antibodies. 

SEQ ID NO:459 corresponds to amino acids 160-179, an epitope of L523S-specific in 
the generation of antibodies. 
30 SEQ ID NO:460 corresponds to amino acids 180-199, an epitope of L523S-specific in 
the generation of antibodies. 
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SEQ ID NO:461 corresponds to amino acids 320-339, an epitope of L523S-specific in 
the generation of antibodies. » 
SEQ ID NO:462 corresponds to amino acids 340-359, an epitope of L523S-specific in 
the generation of antibodies. 
5 SEQ ID NO:463 corresponds to amino acids 370-389, an epitope of L523S-specific in 
the generation of antibodies. 

SEQ ID NO:464 corresponds to amino acids 380-399, an epitope of L523S-specific in 
the generation of antibodies. 

SEQ ED NO:465 corresponds to amino acids 37-55, an epitope of L523S-recognized by 
1 0 the L523 S-specific CTL line 6B 1 . 

SEQ ID NO:466 corresponds to amino acids 41-51, the mapped antigenic epitope of 
L523S-recognized by the L523 S-specific CTL line 6B1 . 

SEQ ID NO:467 corresponds to the DNA sequence which encodes SEQ ID NO:466. 
SEQ ID NO:468 corresponds to the amino acids of peptide 16, 17 of hL523S. 
1 5 SEQ ID NO:469 corresponds to the amino acids of peptide 16, 17 of mL523S 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed generally to compositions and their use 
in the therapy and diagnosis of cancer, particularly lung cancer. As described further 
below, illustrative compositions of the present invention include, but are not restricted 
20 to, polypeptides, particularly immunogenic polypeptides, polynucleotides encoding such 
polypeptides, antibodies and other binding agents, antigen presenting cells (APCs) and 
immune system cells (e.g., T cells). 

The practice of the present invention will employ, unless indicated 
specifically to the contrary, conventional methods of virology, immunology, 

25 microbiology, molecular biology and recombinant DNA techniques within the skill of 
the art, many of which are described below for the purpose of illustration. Such 
techniques are explained fully in the literature. See, e.g., Sambrook, et al. Molecular 
Cloning: A Laboratory Manual (2nd Edition, 1989); Maniatis et al. Molecular Cloning: 
A Laboratory Manual (1982); DNA Cloning: A Practical Approach, vol. I & II (D. 

30 Glover, ed.); Oligonucleotide Synthesis (N. Gait, ed., 1984); Nucleic Acid 
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Hybridization (B. Haines & S. Higgins, eds., 1985); Transcription and Translation (B. 
Hames & S. Higgins, eds., 1984); Animal Cell Culture (R. Freshney, ed., 1986); Perbal, 
A Practical Guide to Molecular Cloning (1984). 

All publications, patents and patent applications cited herein, whether 
5 supra or infra, are hereby incorporated by reference in their entirety. 

As used in this specification and the appended claims, the singular forms 
"a," "an" and "the" include plural references unless the content clearly dictates 
otherwise. 

Polypeptide Compositions 

10 As used herein, the term "polypeptide" " is used in its conventional 

meaning, Le., as a sequence of amino acids. The polypeptides are not limited to a 
specific length of the product; thus, peptides, oligopeptides, and proteins are included 
within the definition of polypeptide, and such terms may be used interchangeably herein 
unless specifically indicated otherwise. This term also does not refer to or exclude post- 

15 expression modifications of the polypeptide, for example, glycosylations, acetylations, 
phosphorylations and the like, as well as other modifications known in the art, both 
naturally occurring and non-naturally occurring. A polypeptide may be an entire 
protein, or a subsequence thereof. Particular polypeptides of interest in the context of 
this invention are amino acid subsequences comprising epitopes, z.e., antigenic 

20 determinants substantially responsible for the immunogenic properties of a polypeptide 
and being capable of evoking an immune response. 

Particularly illustrative polypeptides of the present invention comprise 
those encoded by a polynucleotide sequence set forth in any one of SEQ ID NO: 1-3, 6- 
8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80- 

25 82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 
154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 
210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 
420, 424, 428, 431, 434, 442, 447, 450 and 467, or a sequence that hybridizes under 
moderately stringent conditions, or, alternatively, under highly stringent conditions, to a 

30 polynucleotide sequence set forth in any one of SEQ ID NO: 1-3, 6-8, 10-13, 15-27, 29, 
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30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86-96, 107- 
109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154, 157, 158, 
160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207,209,210,213,214, 
217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 424, 428, 
5 431, 434, 442, 447, 450 and 467. Certain illustrative polypeptides of the invention 
comprise amino acid sequences as set forth in any one of SEQ ID NO:152, 155, 156, 
165, 166, 169, 170, 172, 174, 176, 226-252, 338-344, 346, 350, 357, 361, 363, 365, 
367, 369, 376-382, 387-419, 423, 427, 430, 433, 441, 443, 446, 449, 451-466 and 468- 
469. 

10 The polypeptides of the present invention are sometimes herein referred 

to as lung tumor proteins or lung tumor polypeptides, as an indication that their 
identification has been based at least in part upon their increased levels of expression in 
lung tumor samples. Thus, a "lung tumor polypeptide" or "lung tumor protein, 1 ' refers 
generally to a polypeptide sequence of the present invention, or a polynucleotide 

15 sequence encoding such a polypeptide, that is expressed in a substantial proportion of 
lung tumor samples, for example preferably greater than about 20%, more preferably 
greater than about 30%, and most preferably greater than about 50% or more of lung 
tumor samples tested, at a level that is at least two fold, and preferably at least five fold, 
greater than the level of expression in normal tissues, as determined using a 

20 representative assay provided herein. A lung tumor polypeptide sequence of the 
invention, based upon its increased level of expression in tumor cells, has particular 
utility both as a diagnostic marker as well as a therapeutic target, as further described 
below. 

In certain preferred embodiments, the polypeptides of the invention are 
25 immunogenic, Le. 9 they react detectably within an immunoassay (such as an ELISA or 
T-cell stimulation assay) with antisera and/or T-cells from a patient with lung cancer. 
Screening for immunogenic activity can be performed using techniques well known to 
the skilled artisan. For example, such screens can be performed using methods such as 
those described in Harlow and Lane, Antibodies: A Laboratory Manual Cold Spring 
30 Harbor Laboratory, 1988. In one illustrative example, a polypeptide may be 
immobilized on a solid support and contacted with patient sera to allow binding of 
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antibodies within the sera to the immobilized polypeptide. Unbound sera may then be 
removed and bound antibodies detected using, for example, 125 I-labeled Protein A. 

As would be recognized by the skilled artisan, immunogenic portions of 
the polypeptides disclosed herein are also encompassed by the present invention. An 
5 "immunogenic portion," as used herein, is a fragment of an immunogenic polypeptide 
of the invention that itself is immunologically reactive (le., specifically binds) with the 
B-cells and/or T-cell surface antigen receptors that recognize the polypeptide. 
Immunogenic portions may generally be identified using well known techniques, such 
as those summarized in Paul, Fundamental Immunology, 3rd ed., 243-247 (Raven Press, 

10 1993) and references cited therein. Such techniques include screening polypeptides for 
the ability to react with antigen-specific antibodies, antisera and/or T-cell lines or 
clones. As used herein, antisera and antibodies are "antigen-specific" if they 
specifically bind to an antigen (i.e., they react with the protein in an ELISA or other 
immunoassay, and do not react detectably with unrelated proteins). Such antisera and 

1 5 antibodies may be prepared as described herein, and using well-known techniques. 

In one preferred embodiment, an immunogenic portion of a polypeptide 
of the present invention is a portion that reacts with antisera and/or T-cells at a level that 
is not substantially less than the reactivity of the full-length polypeptide (e.g., in an 
ELISA and/or T-cell reactivity assay). Preferably, the level of immunogenic activity of 

20 the immunogenic portion is at least about 50%, preferably at least about 70% and most 
preferably greater than about 90% of the immunogenicity for the full-length 
polypeptide. In some instances, preferred immunogenic portions will be identified that 
have a level of immunogenic activity greater than that of the corresponding full-length 
polypeptide, e.g., having greater than about 100% or 150% or more immunogenic 

25 activity. 

In certain other embodiments, illustrative immunogenic portions may 
include peptides in which an N-terminal leader sequence and/or transmembrane domain 
have been deleted. Other illustrative immunogenic portions will contain a small N- 
and/or C-terminal deletion (e.g., 1-30 amino acids, preferably 5-15 amino acids), 
30 relative to the mature protein. 
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In another embodiment, a polypeptide composition of the invention may 
also comprise one or more polypeptides that are immunologically reactive with T cells 
and/or antibodies generated against a polypeptide of the invention, particularly a 
polypeptide having an amino acid sequence disclosed herein, or to an immunogenic 
5 fragment or variant thereof 

In another embodiment of the invention, polypeptides are provided that 
comprise one or more polypeptides that are capable of eliciting T cells and/or antibodies 
that are immunologically reactive with one or more polypeptides described herein, or 
one or more polypeptides encoded by contiguous nucleic acid sequences contained in 

10 the polynucleotide sequences disclosed herein, or immunogenic fragments or variants 
thereof, or to one or more nucleic acid sequences which hybridize to one or more of 
these sequences under conditions of moderate to high stringency. 

The present invention, in another aspect, provides polypeptide fragments 
comprising at least about 5, 10, 15, 20, 25, 50, or 100 contiguous amino acids, or more, 

15 including all intermediate lengths, of a polypeptide compositions set forth herein, such 
as those set forth in SEQ ID NO:152, 155, 156, 165, 166, 169, 170, 172, 174, 176, 226- 
252, 338-344, 346, 350, 357, 361, 363, 365, 367, 369, 376-382 and 387-419, 441, 443, 
446, 449, 451-466 and 468-469, or those encoded by a polynucleotide sequence set 
forth in a sequence of SEQ ID NO:l-3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 

20 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 125, 127, 
128, 129, 131-133, 142, 144, 148-151, 153, 154, 157, 158, 160, 167, 168, 171, 179, 
182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 213, 214, 217, 220-224, 253-337, 
345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 
and 467. 

25 In another aspect, the present invention provides variants of the 

polypeptide compositions described herein. Polypeptide variants generally 
encompassed by the present invention will typically exhibit at least about 70%, 75%, 
80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or more identity 
(determined as described below), along its length, to a polypeptide sequences set forth 

30 herein. 
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In one preferred embodiment, the polypeptide fragments and variants 
provide by the present invention are immunologically reactive with an antibody and/or 
T-cell that reacts with a full-length polypeptide specifically set for the herein. 

In another preferred embodiment, the polypeptide fragments and variants 
5 provided by the present invention exhibit a level of immunogenic activity of at least 
about 50% 5 preferably at least about 70%, and most preferably at least about 90% or 
more of that exhibited by a full-length polypeptide sequence specifically set forth 
herein. 

A polypeptide 'Variant," as the term is used herein, is a polypeptide that 
10 typically differs from a polypeptide specifically disclosed herein in one or more 
substitutions, deletions, additions and/or insertions. Such variants may be naturally 
occurring or may be synthetically generated, for example, by modifying one or more of 
the above polypeptide sequences of the invention and evaluating their immunogenic 
activity as described herein and/or using any of a number of techniques well known in 
15 the art. 

For example, certain illustrative variants of the polypeptides of the 
invention include those in which one or more portions, such as an N-terminal leader 
sequence or transmembrane domain, have been removed. Other illustrative variants 
include variants in which a small portion (e.g., 1-30 amino acids, preferably 5-15 amino 

20 acids) has been removed from the N- and/or C-terminal of the mature protein. 

In many instances, a variant will contain conservative substitutions. A 
"conservative substitution" is one in which an amino acid is substituted for another 
amino acid that has similar properties, such that one skilled in the art of peptide 
chemistry would expect the secondary structure and hydropathic nature of the 

25 polypeptide to be substantially unchanged. As described above, modifications may be 
made in the structure of the polynucleotides and polypeptides of the present invention 
and still obtain a functional molecule that encodes a variant or derivative polypeptide 
with desirable characteristics, e.g., with immunogenic characteristics. When it is 
desired to alter the amino acid sequence of a polypeptide to create an equivalent, or 

30 even an improved, immunogenic variant or portion of a polypeptide of the invention, 
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one skilled in the art will typically change one or more of the codons of the encoding 
DNA sequence according to Table 1. 

For example, certain amino acids may be substituted for other amino 
acids in a protein structure without appreciable loss of interactive binding capacity with 
structures such as, for example, antigen-binding regions of antibodies or binding sites 
on substrate molecules. Since it is the interactive capacity and nature of a protein that 
defines that protein's biological functional activity, certain amino acid sequence 
substitutions can be made in a protein sequence, and, of course, its underlying DNA 
coding sequence, and nevertheless obtain a protein with like properties. It is thus 
contemplated that various changes may be made in the peptide sequences of the 
disclosed compositions, or corresponding DNA sequences which encode said peptides 
without appreciable loss of their biological utility or activity. 
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Table 1 



Amino Acids Codons 



Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cysteine 


Cys 


C 


UGC 


UGU 










Aspartic acid 


Asp 


D 


GAC 


GAU 










Glutamic acid 


Glu 


E 


GAA 


GAG 










Phenylalanine 


Phe 


F 


UUC 


UUU 










Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 






Histidine 


His 


H 


CAC 


CAU 










Isoleucine 


lie 


I 


AUA 


AUC 


AUU 








Lysine 


Lys 


K 


AAA 


AAG 










Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue 


CUG 


CUU 


Methionine 


Met 


M 


AUG 












Asparagine 


Asn 


N 


AAC 


AAU 










Proline 


Pro 


P 


CGA 


ccc 


CCG 


ecu 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGU 


Serine 


Ser 


S 


AGC 


AGU 


UCA 


UCC 


UCG 


UCU 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 






Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 






Tryptophan 


Trp 


W 


UGG 












Tyrosine 


Tyr 


Y 


UAC 


UAU 











In making such changes, the hydropathic index of amino acids may be 
considered. The importance of the hydropathic amino acid index in conferring 
5 interactive biologic function on a protein is generally understood in the art (Kyte and 
Doolittle, 1982, incorporated herein by reference). It is accepted that the relative 
hydropathic character of the amino acid contributes to the secondary structure of the 
resultant protein, which in turn defines the interaction of the protein with other 
molecules, for example, enzymes, substrates, receptors, DNA, antibodies, antigens, and 
10 the like. Each amino acid has been assigned a hydropathic index on the basis of its 
hydrophobicity and charge characteristics (Kyte and Doolittle, 1982). These values are: 
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isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine 
(+2.5); methionine (+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); 
tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); 
glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-A5). 
5 It is known in the art that certain amino acids may be substituted by other 

amino acids having a similar hydropathic index or score and still result in a protein with 
similar biological activity, ie. still obtain a biological functionally equivalent protein. 
In making such changes, the substitution of amino acids whose hydropathic indices are 
within +2 is preferred, those within +1 are particularly preferred, and those within ±0.5 

10 are even more particularly preferred. It is also understood in the art that the substitution 
of like amino acids can be made effectively on the basis of hydrophilicity. U. S. Patent 
4,554,101 (specifically incorporated herein by reference in its entirety), states that the 
greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of 
its adjacent amino acids, correlates with a biological property of the protein. 

15 As detailed in U. S. Patent 4,554,101, the following hydrophilicity values 

have been assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate 
(+3.0 ± 1); glutamate (+3.0 ± 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); 
glycine (0); threonine (-0.4); proline (-0.5 ± 1); alanine (-0.5); histidine (-0.5); cysteine 
(-1.0); methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (- 

20 2.3); phenylalanine (-2.5); tryptophan (-3.4). It is understood that an amino acid can be 
substituted for another having a similar hydrophilicity value and still obtain a 
biologically equivalent, and in particular, an immunologically equivalent protein. In 
such changes, the substitution of amino acids whose hydrophilicity values are within ±2 
is preferred, those within ±1 are particularly preferred, and those within ±0.5 are even 

25 more particularly preferred. 

As outlined above, amino acid substitutions are generally therefore based 
on the relative similarity of the amino acid side-chain substituents, for example, their 
hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions that 
take various of the foregoing characteristics into consideration are well known to those 

30 of skill in the art and include: arginine and lysine; glutamate and aspartate; serine and 
threonine; glutamine and asparagine; and valine, leucine and isoleucine. 
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In addition, any polynucleotide may be further modified to increase 
stability in vivo. Possible modifications include, but are not limited to, the addition of 
flanking sequences at the 5' and/or 3' ends; the use of phosphorothioate or 2 ! O-methyl 
rather than phosphodiesterase linkages in the backbone; and/or the inclusion of 
5 nontraditional bases such as inosine, queosine and wybutosine, as well as acetyl- 
methyl-, thio- and other modified forms of adenine, cytidine, guanine, thymine and 
uridine. 

Amino acid substitutions may further be made on the basis of similarity 
in polarity, charge, solubility, hydrophobicity, hydrophilicity and/or the amphipathic 

10 nature of the residues. For example, negatively charged amino acids include aspartic 
acid and glutamic acid; positively charged amino acids include lysine and arginine; and 
amino acids with uncharged polar head groups having similar hydrophilicity values 
include leucine, isoleucine and valine; glycine and alanine; asparagine and glutamine; 
and serine, threonine, phenylalanine and tyrosine. Other groups of amino acids that may 

15 represent conservative changes include: (l)ala, pro, gly, glu, asp, gin, asn, ser, thr; 
(2) cys, ser, tyr, thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, tip, 
his. A variant may also, or alternatively, contain non-conservative changes. In a 
preferred embodiment, variant polypeptides differ from a native sequence by 
substitution, deletion or addition of five amino acids or fewer. Variants may also (or 

20 alternatively) be modified by, for example, the deletion or addition of amino acids that 
have minimal influence on the immunogenicity, secondary structure and hydropathic 
nature of the polypeptide. 

As noted above, polypeptides may comprise a signal (or leader) sequence 
at the N-terminal end of the protein, which co-translationally or post-translationally 

25 directs transfer of the protein. The polypeptide may also be conjugated to a linker or 
other sequence for ease of synthesis, purification or identification of the polypeptide 
(e.g., poly-His), or to enhance binding of the polypeptide to a solid support. For 
example, a polypeptide may be conjugated to an immunoglobulin Fc region. 

When comparing polypeptide sequences, two sequences are said to be 

30 "identical" if the sequence of amino acids in the two sequences is the same when 
aligned for maximum correspondence, as described below. Comparisons between two 
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sequences are typically performed by comparing the sequences over a comparison 
window to identify and compare local regions of sequence similarity. A "comparison 
window 3 ' as used herein, refers to a segment of at least about 20 contiguous positions, 
usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a 
5 reference sequence of the same number of contiguous positions after the two sequences 
are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted using 
the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, 
Inc., Madison, WI), using default parameters. This program embodies several 

10 alignment schemes described in the following references: Dayhoff, M.O. (1978) A 
model of evolutionary change in proteins - Matrices for detecting distant relationships. 
In Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 
Research Foundation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) 
Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology 

15 vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 
CABIOS 5:151-153; Myers, E.W. and Muller W. (1988) CABIOS 4:11-17; Robinson, 
E.D. (1971) Comb. Theor 77:105; Santou, N. Nes, M. (1987) Mot Biot Evot 4:406- 
425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy - the Principles and 
Practice of Numerical Taxonomy ; Freeman Press, San Francisco, CA; Wilbur, W.J. and 

20 Lipman, D.J. (1983) Proc. Natl. Acad, Sci. USA 50:726-730. 

Alternatively, optimal alignment of sequences for comparison may be 
conducted by the local identity algorithm of Smith and Waterman (1981) Add. API. 
Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. 
Mot Biol. 48:443, by the search for similarity methods of Pearson and Lipman (1988) 

25 Proc. Natl Acad Sci. USA 85: 2444, by computerized implementations of these 
algorithms (GAP, BESTFTT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 
- Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), 
or by inspection. 

One preferred example of algorithms that are suitable for determining 
30 percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 
algorithms, which are described in Altschul et al. (1977) Nucl Acids Res. 25:3389-3402 
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and Altschul et al. (1990) 1 Mol Biol 215:403-410, respectively. BLAST and BLAST 
2.0 can be used, for example with the parameters described herein, to determine percent 
sequence identity for the polynucleotides and polypeptides of the invention. Software 
for performing BLAST analyses is publicly available through the National Center for 

5 Biotechnology Information. For amino acid sequences, a scoring matrix can be used to 
calculate the cumulative score. Extension of the word hits in each direction are halted 
when: the cumulative alignment score falls off by the quantity X from its maximum 
achieved value; the cumulative score goes to zero or below, due to the accumulation of 
one or more negative-scoring residue alignments; or the end of either sequence is 

10 reached. The BLAST algorithm parameters W, T and X determine the sensitivity and 
speed of the alignment. 

In one preferred approach, the "percentage of sequence identity 3 ' is 
determined by comparing two optimally aligned sequences over a window of 
comparison of at least 20 positions, wherein the portion of the polypeptide sequence in 

15 the comparison window may comprise additions or deletions (i.e., gaps) of 20 percent 
or less, usually 5 to 15 percent, or 10 to 12 percent, as compared to the reference 
sequences (which does not comprise additions or deletions) for optimal alignment of the 
two sequences. The percentage is calculated by determining the number of positions at 
which the identical amino acid residue occurs in both sequences to yield the number of 

20 matched positions, dividing the number of matched positions by the total number of 
positions in the reference sequence (i.e., the window size) and multiplying the results by 
100 to yield the percentage of sequence identity. 

Within other illustrative embodiments, a polypeptide may be a fusion 
polypeptide that comprises multiple polypeptides as described herein, or that comprises 

25 at least one polypeptide as described herein and an unrelated sequence, such as a known 
tumor protein. A fusion partner may, for example, assist in providing T helper epitopes 
(an immunological fusion partner), preferably T helper epitopes recognized by humans, 
or may assist in expressing the protein (an expression enhancer) at higher yields than the 
native recombinant protein. Certain preferred fusion partners are both immunological 

30 and expression enhancing fusion partners. Other fusion partners may be selected so as 
to increase the solubility of the polypeptide or to enable the polypeptide to be targeted to 
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desired intracellular compartments. Still further fusion partners include affinity tags, 
which facilitate purification of the polypeptide. 

Fusion polypeptides may generally be prepared using standard 
techniques, including chemical conjugation. Preferably, a fusion polypeptide is 
5 expressed as a recombinant polypeptide, allowing the production of increased levels, 
relative to a non-fused polypeptide, in an expression system. Briefly, DNA sequences 
encoding the polypeptide components may be assembled separately, and ligated into an 
appropriate expression vector. The 3' end of the DNA sequence encoding one 
polypeptide component is ligated, with or without a peptide linker, to the 5' end of a 

10 DNA sequence encoding the second polypeptide component so that the reading frames 
of the sequences are in phase. This permits translation into a single fusion polypeptide 
that retains the biological activity of both component polypeptides. 

A peptide linker sequence may be employed to separate the first and 
second polypeptide components by a distance sufficient to ensure that each polypeptide 

15 folds into its secondary and tertiary structures. Such a peptide linker sequence is 
incorporated into the fusion polypeptide using standard techniques well known in the 
art. Suitable peptide linker sequences may be chosen based on the following factors: 
(1) their ability to adopt a flexible extended conformation; (2) their inability to adopt a 
secondary structure that could interact with functional epitopes on the first and second 

20 polypeptides; and (3) the lack of hydrophobic or charged residues that might react with 
the polypeptide functional epitopes. Preferred peptide linker sequences contain Gly, 
Asn and Ser residues. Other near neutral amino acids, such as Thr and Ala may also be 
used in the linker sequence. Amino acid sequences which may be usefully employed as 
linkers include those disclosed in Maratea et al., Gene 40:39-46, 1985; Murphy et al., 

25 Proc. Natl Acad Sci USA 55:8258-8262, 1986; U.S. Patent No. 4,935,233 and U.S. 
Patent No. 4,751,180. The linker sequence may generally be from 1 to about 50 amino 
acids in length. Linker sequences are not required when the first and second 
polypeptides have non-essential N-terminal amino acid regions that can be used to 
separate the functional domains and prevent steric interference. 

30 The ligated DNA sequences are operably linked to suitable 

transcriptional or translational regulatory elements. The regulatory elements 
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responsible for expression of DNA are located only 5' to the DNA sequence encoding 
the first polypeptides. Similarly, stop codons required to end translation and 
transcription termination signals are only present 3' to the DNA sequence encoding the 
second polypeptide. 

5 The fusion polypeptide can comprise a polypeptide as described herein 

together with an unrelated immunogenic protein, such as an immunogenic protein 
capable of eliciting a recall response. Examples of such proteins include tetanus, 
tuberculosis and hepatitis proteins {see, for example, Stoute et aL New Engl J. Med. 9 
556:86-91, 1997). 

10 In one preferred embodiment, the immunological fusion partner is 

derived from a Mycobacterium sp., such as a Mycobacterium tuberculosis-derived Ral2 
fragment. Ral2 compositions and methods for their use in enhancing the expression 
and/or immunogenicity of heterologous polynucleotide/polypeptide sequences is 
described in U.S. Patent Application 60/158,585, the disclosure of which is 

15 incorporated herein by reference in its entirety. Briefly, Ral2 refers to a polynucleotide 
region that is a subsequence of a Mycobacterium tuberculosis MTB32A nucleic acid. 
MTB32A is a serine protease of 32 KD molecular weight encoded by a gene in virulent 
and avirulent strains of M tuberculosis. The nucleotide sequence and amino acid 
sequence of MTB32A have been described (for example, U.S. Patent Application 

20 60/158,585; see also, Skeiky et aL, Infection and Immun (1999) 67:3998-4007, 
incorporated herein by reference). C-terminal fragments of the MTB32A coding 
sequence express at high levels and remain as a soluble polypeptides throughout the 
purification process. Moreover, Ral2 may enhance the immunogenicity of heterologous 
immunogenic polypeptides with which it is fused. One preferred Ral2 fusion 

25 polypeptide comprises a 14 KD C-terminal fragment corresponding to amino acid 
residues 192 to 323 of MTB32A. 

Other preferred Ral2 polynucleotides generally comprise at least about 
15 consecutive nucleotides, at least about 30 nucleotides, at least about 60 nucleotides, 
at least about 100 nucleotides, at least about 200 nucleotides, or at least about 300 

30 nucleotides that encode a portion of a Ral2 polypeptide. 
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Ral2 polynucleotides may comprise a native sequence (Le., an 
endogenous sequence that encodes a Ral2 polypeptide or a portion thereof) or may 
comprise a variant of such a sequence. Ral2 polynucleotide variants may contain one 
or more substitutions, additions, deletions and/or insertions such that the biological 
5 activity of the encoded fusion polypeptide is not substantially diminished, relative to a 
fusion polypeptide comprising a native Ral2 polypeptide. Variants preferably exhibit at 
least about 70% identity, more preferably at least about 80% identity and most 
preferably at least about 90% identity to a polynucleotide sequence that encodes a native 
Ral 2 polypeptide or a portion thereof. 

10 Within other preferred embodiments, an immunological fusion partner is 

derived from protein D, a surface protein of the gram-negative bacterium Haemophilus 
influenza B (WO 91/18926). Preferably, a protein D derivative comprises 
approximately the first third of the protein (e.g., the first N-terminal 100-110 amino 
acids), and a protein D derivative may be lipidated. Within certain preferred 

15 embodiments, the first 109 residues of a Lipoprotein D fusion partner is included on the 
N-terminus to provide the polypeptide with additional exogenous T-cell epitopes and to 
increase the expression level in E. coli (thus functioning as an expression enhancer). 
The lipid tail ensures optimal presentation of the antigen to antigen presenting cells. 
Other fusion partners include the non-structural protein from influenzae virus, NS1 

20 (hemaglutinin). Typically, the N-terminal 81 amino acids are used, although different 
fragments that include T-helper epitopes may be used. 

In another embodiment, the immunological fusion partner is the protein 
known as LYTA, or a portion thereof (preferably a C-terminal portion). LYTA is 
derived from Streptococcus pneumoniae, which synthesizes an N-acetyl-L-alanine 

25 amidase known as amidase LYTA (encoded by the LytA gene; Gene ¥3:265-292, 1986). 
LYTA is an autolysin that specifically degrades certain bonds in the peptidoglycan 
backbone. The C-terminal domain of the LYTA protein is responsible for the affinity to 
the choline or to some choline analogues such as DEAE. This property has been 
exploited for the development of E. coli C-LYTA expressing plasmids useful for 

30 expression of fusion proteins. Purification of hybrid proteins containing the C-LYTA 
fragment at the amino terminus has been described (see Biotechnology 70:795-798, 
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1992). Within a preferred embodiment, a repeat portion of LYTA may be incorporated 
into a fusion polypeptide. A repeat portion is found in the C-terminal region starting at 
residue 178. A particularly preferred repeat portion incorporates residues 188-305. 

Yet another illustrative embodiment involves fusion polypeptides, and 
5 the polynucleotides encoding them, wherein the fusion partner comprises a targeting 
signal capable of directing a polypeptide to the endosomal/lysosomal compartment, as 
described in U.S. Patent No. 5,633,234. An immunogenic polypeptide of the invention, 
when fused with this targeting signal, will associate more efficiently with MHC class II 
molecules and thereby provide enhanced in vivo stimulation of CD4 + T-cells specific 

1 0 for the polypeptide. 

Polypeptides of the invention are prepared using any of a variety of well 
known synthetic and/or recombinant techniques, the latter of which are further 
described below. Polypeptides, portions and other variants generally less than about 
150 amino acids can be generated by synthetic means, using techniques well known to 

15 those of ordinary skill in the art. In one illustrative example, such polypeptides are 
synthesized using any of the commercially available solid-phase techniques, such as the 
Merrifield solid-phase synthesis method, where amino acids are sequentially added to a 
growing amino acid chain. See Merrifield, J. Am. Chem. Soc. 55:2149-2146, 1963. 
Equipment for automated synthesis of polypeptides is commercially available from 

20 suppliers such as Perkin Elmer/Applied BioSystems Division (Foster City, CA), and 
may be operated according to the manufacturer's instructions. 

In general, polypeptide compositions (including fusion polypeptides) of 
the invention are isolated. An "isolated" polypeptide is one that is removed from its 
original environment. For example, a naturally-occurring protein or polypeptide is 

25 isolated if it is separated from some or all of the coexisting materials in the natural 
system. Preferably, such polypeptides are also purified, e.g., are at least about 90% 
pure, more preferably at least about 95% pure and most preferably at least about 99% 
pure. 

Polynucleotide Compositions 
30 The present invention, in other aspects, provides polynucleotide 

compositions. The terms "DNA" and "polynucleotide" are used essentially 
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interchangeably herein to refer to a DNA molecule that has been isolated free of total 
genomic DNA- of a particular species. "Isolated," as used herein, means that a 
polynucleotide is substantially away from other coding sequences, and that the DNA 
molecule does not contain large portions of unrelated coding DNA, such as large 
5 chromosomal fragments or other functional genes or polypeptide coding regions. Of 
course, this refers to the DNA molecule as originally isolated, and does not exclude 
genes or coding regions later added to the segment by the hand of man. 

As will be understood by those skilled in the art, the polynucleotide 
compositions of this invention can include genomic sequences, extra-genomic and 

10 plasmid-encoded sequences and smaller engineered gene segments that express, or may 
be adapted to express, proteins, polypeptides, peptides and the like. Such segments may 
be naturally isolated, or modified synthetically by the hand of man. 

As will be also recognized by the skilled artisan, polynucleotides of the 
invention may be single-stranded (coding or antisense) or double-stranded, and may be 

15 DNA (genomic, cDNA or synthetic) or RNA molecules. RNA molecules may include 
HnRNA molecules, which contain introns and correspond to a DNA molecule in a one- 
to-one manner, and mRNA molecules, which do not contain introns. Additional coding 
or non-coding sequences may, but need not, be present within a polynucleotide of the 
present invention, and a polynucleotide may, but need not, be linked to other molecules 

20 and/or support materials. 

Polynucleotides may comprise a native sequence (i.e, an endogenous 
sequence that encodes a polypeptide/protein of the invention or a portion thereof) or 
may comprise a sequence that encodes a variant or derivative, preferably and 
immunogenic variant or derivative, of such a sequence. 

25 Therefore, according to another aspect of the present invention, 

polynucleotide compositions are provided that comprise some or all of a polynucleotide 
sequence set forth in any one of SEQ ID NO:l-3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49, 
51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 
125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154, 157, 158, 160, 167, 168, 

30 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 213, 214, 217, 220-224, 
253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 424, 428, 431, 434, 
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442, 447, 450 and 467, complements of a polynucleotide sequence set forth in any one 
of SEQ ID NO:l-3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59; 61-69, 
71, 73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 
142, 144, 148-151, 153, 154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 
5 193, 194, 198-207, 209, 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 
364, 365, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467, and degenerate 
variants of a polynucleotide sequence set forth in any one of SEQ ID NO:l-3, 6-8, 10- 
13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 
86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154, 
10 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 
213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 
424, 428, 431, 434, 442, 447, 450 and 467. In certain preferred embodiments, the 
polynucleotide sequences set forth herein encode immunogenic polypeptides, as 
described above. 

15 In other related embodiments, the present invention provides 

polynucleotide variants having substantial identity to the sequences disclosed herein in 
SEQ ID NO:l-3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 
73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 
144, 148-151, 153, 154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 

20 194, 198-207, 209, 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 
' 365, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467, for example those 
comprising at least 70% sequence identity, preferably at least 75%, 80%, 85%, 90%, 
95%, 96%, 97%, 98%, or 99% or higher, sequence identity compared to a 
polynucleotide sequence of this invention using the methods described herein, (e.g., 

25 BLAST analysis using standard parameters, as described below). One skilled in this art 
will recognize that these values can be appropriately adjusted to determine 
corresponding identity of proteins encoded by two nucleotide sequences by taking into 
account codon degeneracy, amino acid similarity, reading frame positioning and the 
like. 

30 Typically, polynucleotide variants will contain one or more substitutions, 

additions, deletions and/or insertions, preferably such that the immunogenicity of the 
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polypeptide encoded by the variant polynucleotide is not substantially diminished 
relative to a polypeptide encoded by a polynucleotide sequence specifically set forth 
herein). The term 'Variants" should also be understood to encompasses homologous 
genes of xenogenic origin. 
5 In additional embodiments, the present invention provides 

polynucleotide fragments comprising various lengths of contiguous stretches of 
sequence identical to or complementary to one or more of the sequences disclosed 
herein. For example, polynucleotides are provided by this invention that comprise at 
least about 10, 15, 20, 30, 40, 50, 75, 100, 150, 200, 300, 400, 500 or 1000 or more 

10 contiguous nucleotides of one or more of the sequences disclosed herein as well as all 
intermediate lengths there between. It will be readily understood that "intermediate 
lengths", in this context, means any length between the quoted values, such as 16, 17, 
18, 19, etc.; 21, 22, 23, etc.; 30, 31, 32, etc.; 50, 51, 52, 53, etc.; 100, 101, 102, 103, 
etc.; 150, 151, 152, 153, etc.; including all integers through 200-500; 500-1,000, and the 

15 like. 

In another embodiment of the invention, polynucleotide compositions are 
provided that are capable of hybridizing under moderate to high stringency conditions to 
a polynucleotide sequence provided herein, or a fragment thereof, or a complementary 
sequence thereof. Hybridization techniques are well known in the art of molecular 

20 biology. For purposes of illustration, suitable moderately stringent conditions for 
testing the hybridization of a polynucleotide of this invention with other polynucleotides 
include prewashing in a solution of 5 X SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0); 
hybridizing at 50°C-60°C, 5 X SSC, overnight; followed by washing twice at 65°C for 
20 minutes with each of 2X, 0.5X and 0.2X SSC containing 0.1% SDS. One skilled in 

25 the art will understand that the stringency of hybridization can be readily manipulated, 
such as by altering the salt content of the hybridization solution and/or the temperature 
at which the hybridization is performed. For example, in another embodiment, suitable 
highly stringent hybridization conditions include those described above, with the 
exception that the temperature of hybridization is increased, e.g., to 60-65°C or 65- 

30 70°C. 
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In certain preferred embodiments, the polynucleotides described above, 
e.g., polynucleotide variants, fragments and hybridizing sequences, encode polypeptides 
that are immunologically cross-reactive with a polypeptide sequence specifically set 
forth herein. In other preferred embodiments, such polynucleotides encode 
5 polypeptides that have a level of immunogenic activity of at least about 50%, preferably 
at least about 70%, and more preferably at least about 90% of that for a polypeptide 
sequence specifically set forth herein. 

The polynucleotides of the present invention, or fragments thereof, 
regardless of the length of the coding sequence itself, may be combined with other DNA 

10 sequences, such as promoters, polyadenylation signals, additional restriction enzyme 
sites, multiple cloning sites, other coding segments, and the like, such that their overall 
length may vary considerably. It is therefore contemplated that a nucleic acid fragment 
of almost any length may be employed, with the total length preferably being limited by 
the ease of preparation and use in the intended recombinant DNA protocol. For 

15 example, illustrative polynucleotide segments with total lengths of about 10,000, about 
5000, about 3000, about 2,000, about 1,000, about 500, about 200, about 100, about 50 
base pairs in length, and the like, (including all intermediate lengths) are contemplated 
to be usefid in many implementations of this invention. 

When comparing polynucleotide sequences, two sequences are said to be 

20 "identical" if the sequence of nucleotides in the two sequences is the same when aligned 
for maximum correspondence, as described below. Comparisons between two 
sequences are typically performed by comparing the sequences over a comparison 
window to identify and compare local regions of sequence similarity. A "comparison 
window" as used herein, refers to a segment of at least about 20 contiguous positions, 

25 usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a 
reference sequence of the same number of contiguous positions after the two sequences 
are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted using 
the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, 

30 Inc., Madison, WI), using default parameters. This program embodies several 
alignment schemes described in the following references: Dayhoff, M.O. (1978) A 
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model of evolutionary change in proteins - Matrices for detecting distant relationships. 
In Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 
Research Foundation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) 
Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology 
5 vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 
CABIOS 5:151-153; Myers, E.W. and Muller W. (1988) CABIOS 4:11A7; Robinson, 
E.D. (1971) Comb. Theor 11:105; Santou, N. Nes, M. (1987) MoL Biol Evol 4:406- 
425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy - the Principles and 
Practice of Numerical Taxonomy, Freeman Press, San Francisco, CA; Wilbur, W.J. and 

10 Lipman, D.J. (1983) Proc. Natl Acad, Sci. USA 50:726-730. 

Alternatively, optimal alignment of sequences for comparison may be 
conducted by the local identity algorithm of Smith and Waterman (1981) Add APL 
Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. 
MoL BioL 48:443, by the search for similarity methods of Pearson and Lipman (1988) 

15 Proc Natl Acad Sci. USA 85: 2444, by computerized implementations of these 
algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 
Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), 
or by inspection. 

One preferred example of algorithms that are suitable for detennining 
20 percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 
algorithms, which are described in Altschul et al. (1977) NucL Acids Res. 25:3389-3402 
and Altschul et al. (1990) J. MoL Biol. 215:403-410, respectively. BLAST and BLAST 
2.0 can be used, for example with the parameters described herein, to determine percent 
sequence identity for the polynucleotides of the invention. Software for performing 
25 BLAST analyses is publicly available through the National Center for Biotechnology 
Information. In one illustrative example, cumulative scores can be calculated using, for 
nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always >0) and N (penalty score for mismatching residues; always <0). Extension of 
the word hits in each direction are halted when: the cumulative alignment score falls off 
30 by the quantity X from its maximum achieved value; the cumulative score goes to zero 
or below, due to the accumulation of one or more negative-scoring residue alignments; 
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or the end of either sequence is reached. The BLAST algorithm parameters W, T and X 
determine the sensitivity and speed of the alignment. The BLASTN program (for 
nucleotide sequences) uses as defaults a wordlength (W) of 11, and expectation (E) of 
10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Natl 
5 Acad. Set USA 89:10915) alignments, (B) of 50, expectation (E) of 10, M=5, N=-4 and 
a comparison of both strands. 

Preferably, the "percentage of sequence identity" is determined by 
comparing two optimally aligned sequences over a window of comparison of at least 20 
positions, wherein the portion of the polynucleotide sequence in the comparison 

10 window may comprise additions or deletions (i.e., gaps) of 20 percent or less, usually 5 
to 15 percent, or 10 to 12 percent, as compared to the reference sequences (which does 
not comprise additions or deletions) for optimal alignment of the two sequences. The 
percentage is calculated by determining the number of positions at which the identical 
nucleic acid bases occurs in both sequences to yield the number of matched positions, 

15 dividing the number of matched positions by the total number of positions in the 
reference sequence (z.e., the window size) and multiplying the results by 100 to yield the 
percentage of sequence identity. 

It will be appreciated by those of ordinary skill in the art that, as a result 
of the degeneracy of the genetic code, there are many nucleotide sequences that encode 

20 a polypeptide as described herein. Some of these polynucleotides bear minimal 
homology to the nucleotide sequence of any native gene. Nonetheless, polynucleotides 
that vary due to differences in codon usage are specifically contemplated by the present 
invention. Further, alleles of the genes comprising the polynucleotide sequences 
provided herein are within the scope of the present invention. Alleles are endogenous 

25 genes that are altered as a result of one or more mutations, such as deletions, additions 
and/or substitutions of nucleotides. The resulting mKNA and protein may, but need not, 
have an altered structure or function. Alleles may be identified using standard 
techniques (such as hybridization, amplification and/or database sequence comparison). 

Therefore, in another embodiment of the invention, a mutagenesis 

30 approach, such as site-specific mutagenesis, is employed for the preparation of 
immunogenic variants and/or derivatives of the polypeptides described herein. By this 
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approach, specific modifications in a polypeptide sequence can be made through 
mutagenesis of the underlying polynucleotides that encode them. These techniques 
provides a straightforward approach to prepare and test sequence variants, for example, 
incorporating one or more of Hie foregoing considerations, by introducing one or more 
5 nucleotide sequence changes into the polynucleotide. 

Site-specific mutagenesis allows the production of mutants through the 
use of specific oligonucleotide sequences which encode the DNA sequence of the 
desired mutation, as well as a sufficient number of adjacent nucleotides, to provide a 
primer sequence of sufficient size and sequence complexity to form a stable duplex on 

10 both sides of the deletion junction being traversed. Mutations may be employed in a 
selected polynucleotide sequence to improve, alter, decrease, modify, or otherwise 
change the properties of the polynucleotide itself, and/or alter the properties, activity, 
composition, stability, or primary sequence of the encoded polypeptide. 

In certain embodiments of the present invention, the inventors 

15 contemplate the mutagenesis of the disclosed polynucleotide sequences to alter one or 
more properties of the encoded polypeptide, such as the immunogenicity of a 
polypeptide vaccine. The techniques of site-specific mutagenesis are well-known in the 
art, and are widely used to create variants of both polypeptides and polynucleotides. For 
example, site-specific mutagenesis is often used to alter a specific portion of a DNA 

20 molecule. In such embodiments, a primer comprising typically about 14 to about 25 
nucleotides or so in length is employed, with about 5 to about 10 residues on both sides 
of the junction of the sequence being altered. 

As will be appreciated by those of skill in the art, site-specific 
mutagenesis techniques have often employed a phage vector that exists in both a single 

25 stranded and double stranded form. Typical vectors useful in site-directed mutagenesis 
include vectors such as the Ml 3 phage. These phage are readily commercially-available 
and their use is generally well-known to those skilled in the art. Double-stranded 
plasmids are also routinely employed in site directed mutagenesis that eliminates the 
step of transferring the gene of interest from a plasmid to a phage. 

30 In general, site-directed mutagenesis in accordance herewith is 

performed by first obtaining a single-stranded vector or melting apart of two strands of a 
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double-stranded vector that includes within its sequence a DNA sequence that encodes 
the desired peptide. An oligonucleotide primer bearing the desired mutated sequence is 
prepared, generally synthetically. This primer is then annealed with the single-stranded 
vector, and subjected to DNA polymerizing enzymes such as E. coli polymerase I 
5 Klenow fragment, in order to complete the synthesis of the mutation-bearing strand. 
Thus, a heteroduplex is formed wherein one strand encodes the original non-mutated 
sequence and the second strand bears the desired mutation. This heteroduplex vector is 
then used to transform appropriate cells, such as E coli cells, and clones are selected 
which include recombinant vectors bearing the mutated sequence arrangement. 

10 The preparation of sequence variants of the selected peptide-encoding 

DNA segments using site-directed mutagenesis provides a means of producing 
potentially useful species and is not meant to be limiting as there are other ways in 
which sequence variants of peptides and the DNA sequences encoding them may be 
obtained. For example, recombinant vectors encoding the desired peptide sequence 

15 may be treated with mutagenic agents, such as hydroxylamine, to obtain sequence 
variants. Specific details regarding these methods and protocols are found in the 
teachings of Maloy et al, 1994; Segal, 1976; Prokop and Bajpai, 1991; Kuby, 1994; and 
Maniatis et al, 1982, each incorporated herein by reference, for that purpose. 

As used herein, the term "oligonucleotide directed mutagenesis 

20 procedure" refers to template-dependent processes and vector-mediated propagation 
which result in an increase in the concentration of a specific nucleic acid molecule 
relative to its initial concentration, or in an increase in the concentration of a detectable 
signal, such as amplification. " As used herein, the term "oligonucleotide directed 
mutagenesis procedure" is intended to refer to a process that involves the 

25 template-dependent extension of a primer molecule. The. term template dependent 
process refers to nucleic acid synthesis of an RNA or a DNA molecule wherein the 
sequence of the newly synthesized strand of nucleic acid is dictated by the well-known 
rules of complementary base pairing (see, for example, Watson, 1987). Typically, 
vector mediated methodologies involve the introduction of the nucleic acid fragment 

30 into a DNA or RNA vector, the clonal amplification of the vector, and the recovery of 
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the amplified nucleic acid fragment. Examples of such methodologies are provided by 
U. S. Patent No. 4,237,224, specifically incorporated herein by reference in its entirety. 

In another approach for the production of polypeptide variants of the 
present invention, recursive sequence recombination, as described in U.S. Patent No. 
5 5,837,458, may be employed. In this approach, iterative cycles of recombination and 
screening or selection are performed to "evolve" individual polynucleotide variants of 
the invention having, for example, enhanced immunogenic activity. 

In other embodiments of the present invention, the polynucleotide 
sequences provided herein can be advantageously used as probes or primers for nucleic 

10 acid hybridization. As such, it is contemplated that nucleic acid segments that comprise 
a sequence region of at least about 15 nucleotide long contiguous sequence that has the 
same sequence as, or is complementary to, a 15 nucleotide long contiguous sequence 
disclosed herein will find particular utility. Longer contiguous identical or 
complementary sequences, e.g., those of about 20, 30, 40, 50, 100, 200, 500, 1000 

15 (including all intermediate lengths) and even up to full length sequences will also be of 
use in certain embodiments. 

The ability of such nucleic acid probes to specifically hybridize to a 
sequence of interest will enable them to be of use in detecting the presence of 
complementary sequences in a given sample. However, other uses are also envisioned, 

20 such as the use of the sequence information for the preparation of mutant species 
primers, or primers for use in preparing other genetic constructions. 

Polynucleotide molecules having sequence regions consisting of 
contiguous nucleotide stretches of 10-14, 15-20, 30, 50, or even of 100-200 nucleotides 
or so (including intermediate lengths as well), identical or complementary to a 

25 polynucleotide sequence disclosed herein, are particularly contemplated as hybridization 
probes for use in, e.g., Southern and Northern blotting. This would allow a gene 
product, or fragment thereof, to be analyzed, both in diverse cell types and also in 
various bacterial cells. The total size of fragment, as well as the size of the 
complementary stretch(es), will ultimately depend on the intended use or application of 

30 the particular nucleic acid segment Smaller fragments will generally find use in 
hybridization embodiments, wherein the length of the contiguous complementary region 
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may be varied, such as between about 15 and about 100 nucleotides, but larger 
contiguous complementarity stretches may be used, according to the length 
complementary sequences one wishes to detect. 

The use of a hybridization probe of about 15-25 nucleotides in length 
5 allows the formation of a duplex molecule that is both stable and selective. Molecules 
having contiguous complementary sequences over stretches greater than 15 bases in 
length are generally preferred, though, in order to increase stability and selectivity of the 
hybrid, and thereby improve the quality and degree of specific hybrid molecules 
obtained. One will generally prefer to design nucleic acid molecules having gene- 
10 complementary stretches of 15 to 25 contiguous nucleotides, or even longer where 
desired. 

Hybridization probes may be selected from any portion of any of the 
sequences disclosed herein. All that is required is to review the sequences set forth 
herein, or to any continuous portion of the sequences, from about 15-25 nucleotides in 
1 5 length up to and including the full length sequence, that one wishes to utilize as a probe 
or primer. The choice of probe and primer sequences may be governed by various 
factors. For example, one may wish to employ primers from towards the termini of the 
total sequence. 

Small polynucleotide segments or fragments may be readily prepared by, 
20 for example, directly synthesizing the fragment by chemical means, as is commonly 
practiced using an automated oligonucleotide synthesizer. Also, fragments may be 
obtained by application of nucleic acid reproduction technology, such as the PCR™ 
technology of U. S. Patent 4,683,202 (incorporated herein by reference), by introducing 
selected sequences into recombinant vectors for recombinant production, and by other 
25 recombinant DNA techniques generally known to those of skill in the art of molecular 
biology. 

The nucleotide sequences of the invention may be used for their ability to 
selectively form duplex molecules with complementary stretches of the entire gene or 
gene fragments of interest Depending on the application envisioned, one will typically 
30 desire to employ varying conditions of hybridization to achieve varying degrees of 
selectivity of probe towards target sequence. For applications requiring high selectivity, 
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one will typically desire to employ relatively stringent conditions to form the hybrids, 
e.g., one will select relatively low salt and/or high temperature conditions, such as 
provided by a salt concentration of from about 0.02 M to about 0.15 M salt at 
temperatures of from about 50°C to about 70°C. Such selective conditions tolerate 
5 little, if any, mismatch between the probe and the template or target strand, and would 
be particularly suitable for isolating related sequences. 

Of course, for some applications, for example, where one desires to 
prepare mutants employing a mutant primer strand hybridized to an underlying 
template, less stringent (reduced stringency) hybridization conditions will typically be 

10 needed in order to allow formation of the heteroduplex. In these circumstances, one 
may desire to employ salt conditions such as those of from about 0.15 M to about 0.9 M 
salt, at temperatures ranging from about 20°C to about 55°C. Cross-hybridizing species 
can thereby be readily identified as positively hybridizing signals with respect to control 
hybridizations. In any case, it is generally appreciated that conditions can be rendered 

15 more stringent by the addition of increasing amounts of fonnamide, which serves to 
destabilize the hybrid duplex in the same manner as increased temperature. Thus, 
hybridization conditions can be readily manipulated, and thus will generally be a 
method of choice depending on the desired results. 

According to another embodiment of the present invention, 

20 polynucleotide compositions comprising antisense oligonucleotides are provided. 
Antisense oligonucleotides have been demonstrated to be effective and targeted 
inhibitors of protein synthesis, and, consequently, provide a therapeutic approach by 
which a disease can be treated by inhibiting the synthesis of proteins that contribute to 
the disease. The efficacy of antisense oligonucleotides for inhibiting protein synthesis 

25 is well established. For example, the synthesis of polygalactauronase and the muscarine 
type 2 acetylcholine receptor are inhibited by antisense oligonucleotides directed to their 
respective mRNA sequences (U. S. Patent 5,739,119 and U. S. Patent 5,759,829). 
Further, examples of antisense inhibition have been demonstrated with the nuclear 
protein cyclin, the multiple drug resistance gene (MDG1), ICAM-1, E-selectin, STK-1, 

30 striatal GABA A receptor and human EGF (Jaskulski et al, Science. 1988 Jun 
10;240(4858):1544-6; Vasanthakumar and Ahmed, Cancer Commun. 1989;1(4):225- 
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32; Peris et aU Brain Res Mol Brain Res. 1998 Jun 15;57(2):3 10-20; U. S. Patent 
5,801,154; U.S. Patent 5,789,573; U. S. Patent 5,718,709 and U.S. Patent 5,610,288). 
Antisense constructs have also been described that inhibit and can be used to treat a 
variety of abnormal cellular proliferations, e.g. cancer (U. S. Patent 5,747,470; U. S. 
5 Patent 5,591,317 and U. S. Patent 5,783,683). 

Therefore, in certain embodiments, the present invention provides 
oligonucleotide sequences that comprise all, or a portion of, any sequence that is 
capable of specifically binding to polynucleotide sequence described herein, or a 
complement thereof. In one embodiment, the antisense oligonucleotides comprise DNA 

10 or derivatives thereof. In another embodiment, the oligonucleotides comprise RNA or 
derivatives thereof. In a third embodiment, the oligonucleotides are modified DNAs 
comprising a phosphorothioated modified backbone. In a fourth embodiment, the 
oligonucleotide sequences comprise peptide nucleic acids or derivatives thereof. In 
each case, preferred compositions comprise a sequence region that is complementary, 

15 and more preferably substantially-complementary, and even more preferably, 
completely complementary to one or more portions of polynucleotides disclosed herein. 
Selection of antisense compositions specific for a given gene sequence is based upon 
analysis of the chosen target sequence and determination of secondary structure, T m , 
binding energy, and relative stability. Antisense compositions may be selected based 

20 upon their relative inability to form dimers, hairpins, or other secondary structures that 
would reduce or prohibit specific binding to the target mRNA in a host cell. Highly 
preferred target regions of the mRNA, are those which are at or near the AUG 
translation initiation codon, and those sequences which are substantially complementary 
to 5' regions of the mRNA. These secondary structure analyses and target site selection 

25 considerations can be performed, for example, using v.4 of the OLIGO primer analysis 
software and/or the BLASTN 2.0.5 algorithm software (Altschul et al s Nucleic Acids 
Res. 1997, 25(17):3389-402). 

The use of an antisense delivery method employing a short peptide 
vector, termed MPG (27 residues), is also contemplated. The MPG peptide contains a 

30 hydrophobic domain derived from the fusion sequence of HTV gp41 and a hydrophilic 
domain from the nuclear localization sequence of SV40 T-antigen (Morris et aL 9 
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Nucleic Acids Res. 1997 Jul 15;25(14):2730-6). It has been demonstrated that several 
molecules of the MPG peptide coat the antisense oligonucleotides and can be delivered 
into cultured mammalian cells in less than 1 hour with relatively high efficiency (90%). 
Further, the interaction with MPG strongly increases both the stability of the 
5 oligonucleotide to nuclease and the ability to cross the plasma membrane. 

According to another embodiment of the invention, the polynucleotide 
compositions described herein are used in the design and preparation of ribozyme 
molecules for inhibiting expression of the tumor polypeptides and proteins of the 
present invention in tumor cells. Ribozymes are RNA-protein complexes that cleave 

10 nucleic acids in a site-specific fashion. Ribozymes have specific catalytic domains that 
possess endonuclease activity (Kim and Cech, Proc Natl Acad Sci USA. 1987 
Dec;84(24):8788-92; Forster and Symons, Cell. 1987 Apr 24;49(2):211-20). For 
example, a large number of ribozymes accelerate phosphoester transfer reactions with a 
high degree of specificity, often cleaving only one of several phosphoesters in an 

15 oligonucleotide substrate (Cech et al 9 Cell. 1981 Dec;27(3 Pt 2):487-96; Michel and 
Westhof, J Mol Biol. 1990 Dec 5;216(3):585-610; Reinhold-Hurek and Shub, Nature. 
1992 May 14;357(6374):173-6). This specificity has been attributed to the requirement 
that the substrate bind via specific base-pairing interactions to the internal guide 
sequence ("IGS") of the ribozyme prior to chemical reaction. 

20 Six basic varieties of naturally-occurring enzymatic RNAs are known 

presently. Each can catalyze the hydrolysis of RNA phosphodiester bonds in trans (and 
thus can cleave other RNA molecules) under physiological conditions. In general, 
enzymatic nucleic acids act by first binding to a target RNA. Such binding occurs 
through the target binding portion of a enzymatic nucleic acid which is held in close 

25 proximity to an enzymatic portion of the molecule that acts to cleave the target RNA. 
Thus, the enzymatic nucleic acid first recognizes and then binds a target RNA through 
complementary base-pairing, and once bound to the correct site, acts enzymatically to 
cut the target RNA. Strategic cleavage of such a target RNA will destroy its ability to 
direct synthesis of an encoded protein. After an enzymatic nucleic acid has bound and 

30 cleaved its RNA target, it is released from that RNA to search for another target and can 
repeatedly bind and cleave new targets. 
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The enzymatic nature of a ribozyme is advantageous over many 
technologies, such as antisense technology (where a nucleic acid molecule simply binds 
to a nucleic acid target to block its translation) since the concentration of ribozyme 
necessary to affect a therapeutic treatment is lower than that of an antisense 
5 oligonucleotide. This advantage reflects the ability of the ribozyme to act 
enzymatically. Thus, a single ribozyme molecule is able to cleave many molecules of 
target RNA. In addition, the ribozyme is a highly specific inhibitor, with the specificity 
of inhibition depending not only on the base pairing mechanism of binding to the target 
RNA, but also on the mechanism of target RNA cleavage. Single mismatches, or base- 

10 substitutions, near the site of cleavage can completely eliminate catalytic activity of a 
ribozyme. Similar mismatches in antisense molecules do not prevent their action 
(Woolf etal, Proc Natl Acad Sci U S A. 1992 Aug 15;89(16):7305-9). Thus, the 
specificity of action of a ribozyme is greater than that of an antisense oligonucleotide 
binding the same RNA site. 

15 The enzymatic nucleic acid molecule may be formed in a hammerhead, 

hairpin, a hepatitis 5 virus, group I intron or RNaseP RNA (in association with an RNA 
guide sequence) or Neurospora VS RNA motif. Examples of hammerhead motifs are 
described by Rossi et al Nucleic Acids Res. 1992 Sep 1 1;20(17):4559-65. Examples of 
haiipin motifs are described by Hampel etal (Eur. Pat. Appl. Publ. No. EP 0360257), 

20 Hampel and Tritz, Biochemistry 1989 Jun 13;28(12):4929-33; Hampel etal, Nucleic 
Acids Res. 1990 Jan 25;18(2):299-304 and U. S. Patent 5,631,359. An example of the 
hepatitis 8 virus motif is described by Perrotta and Been, Biochemistry. 1992 Dec 
1;3 1(47): 11 843-52; an example of the RNaseP motif is described by Guerrier-Takada 
etal, Cell. 1983 Dec;35(3 Pt 2):849-57; Neurospora VS RNA ribozyme motif is 

25 described by Collins (Saville and Collins, Cell. 1990 May 18;61(4):685-96; Saville and 
Collins, Proc Natl Acad Sci USA. 1991 Oct l;88(19):8826-30; Collins and Olive, 
Biochemistry. 1993 Mar 23;32(1 1):2795-9); and an example of the Group I intron is 
described in (U. S. Patent 4,987,071). All that is important in an enzymatic nucleic acid 
molecule of this invention is that it has a specific substrate binding site which is 

30 complementary to one or more of the target gene RNA regions, and that it have 
nucleotide sequences within or surrounding that substrate binding site which impart an 
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RNA cleaving activity to the molecule. Thus the ribozyme constructs need not be 
limited to specific motifs mentioned herein. 

Ribozymes may be designed as described in Int. Pat Appl. Publ. No. 
WO 93/23569 and Int. Pat. Appl. Publ. No. WO 94/02595, each specifically 
5 incorporated herein by reference) and synthesized to be tested in vitro and in vivo, as 
described. Such ribozymes can also be optimized for delivery. While specific 
examples are provided, those in the art will recognize that equivalent RNA targets in 
other species can be utilized when necessary. 

Ribozyme activity can be optimized by altering the length of the 

10 ribozyme binding arms, or chemically synthesizing ribozymes with modifications that 
prevent their degradation by serum ribonucleases (see e.g., Int. Pat Appl. Publ. No. WO 
92/07065; Int. Pat Appl. Publ. No. WO 93/15187; Int. Pat Appl. Publ. No. WO 
91/03162; Eur. Pat Appl. Publ. No. 92110298.4; U. S. Patent 5,334,711; and Int. Pat 
Appl. Publ. No. WO 94/13688, which describe various chemical modifications that can 

15 be made to the sugar moieties of enzymatic RNA molecules), modifications which 
enhance their efficacy in cells, and removal of stem II bases to shorten RNA synthesis 
times and reduce chemical requirements. 

Sullivan etal (Int. Pat. Appl. Publ. No. WO 94/02595) describes the 
general methods for delivery of enzymatic RNA molecules. Ribozymes may be 

20 administered to cells by a variety of methods known to those familiar to the art, 
including, but not restricted to, encapsulation in liposomes, by iontophoresis, or by 
incorporation into other vehicles, such as hydrogels, cyclodextrins, biodegradable 
nanocapsules, and bioadhesive microspheres. For some indications, ribozymes may be 
directly delivered ex vivo to cells or tissues with or without the aforementioned vehicles. 

25 Alternatively, the RNA/vehicle combination may be locally delivered by direct 
inhalation, by direct injection or by use of a catheter, infusion pump or stint. Other 
routes of delivery include, but are not limited to, intravascular, intramuscular, 
subcutaneous or joint injection, aerosol inhalation, oral (tablet or pill form), topical, 
systemic, ocular, intraperitoneal and/or intrathecal delivery. More detailed descriptions 

30 of ribozyme delivery and administration are provided in Int. Pat. Appl. Publ. No. WO 
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94/02595 and Int. Pat. Appl. Publ. No. WO 93/23569, each specifically incorporated 
herein by reference. 

Another means of accumulating high concentrations of a ribozyme(s) 
within cells is to incorporate the ribozyme-encoding sequences into a DNA expression 
5 vector. Transcription of the ribozyme sequences are driven from a promoter for 
eukaryotic RNA polymerase I (pol I), RNA polymerase II (pol II), or RNA polymerase 
HI (pol HI). Transcripts from pol II or pol in promoters will be expressed at high levels 
in all cells; the levels of a given pol II promoter in a given cell type will depend on the 
nature of the gene regulatory sequences (enhancers, silencers, etc.) present nearby. 

10 Prokaryotic RNA polymerase promoters may also be used, providing that the 
prokaryotic RNA polymerase enzyme is expressed in the appropriate cells. Ribozymes 
expressed from such promoters have been shown to function in mammalian cells. Such 
transcription units can be incorporated into a variety of vectors for introduction into 
mammalian cells, including but not restricted to, plasmid DNA vectors, viral DNA 

15 vectors (such as adenovirus or adeno-associated vectors), or viral RNA vectors (such as 
retroviral, semliki forest virus, sindbis virus vectors). 

In another embodiment of the invention, peptide nucleic acids (PNAs) 
compositions are provided. PNA is a DNA mimic in which the nucleobases are 
attached to a pseudopeptide backbone (Good and Nielsen, Antisense Nucleic Acid Drug 

20 Dev. 1997 7(4) 431-37). PNA is able to be utilized in a number methods that 
traditionally have used RNA or DNA. Often PNA sequences perform better in 
techniques than the corresponding RNA or DNA sequences and have utilities that are 
not inherent to RNA or DNA. A review of PNA including methods of making, 
characteristics of, and methods of using, is provided by Corey (Trends Biotechnol 1997 

25 Jun;15(6):224-9). As such, in certain embodiments, one may prepare PNA sequences 
that are complementary to one or more portions of the ACE mRNA sequence, and such 
PNA compositions may be used to regulate, alter, decrease, or reduce the translation of 
ACE-specific mRNA, and thereby alter the level of ACE activity in a host cell to which 
such PNA compositions have been administered. 

30 PNAs have 2-aminoethyl-glycine linkages replacing the normal 

phosphodiester backbone of DNA (Nielsen et al, Science 1991 Dec 6;254(5037):1497- 
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500; Hanvey et al, Science. 1992 Nov 27;258(5087):1481-5; Hyrup and Nielsen, 
Bioorg Med Chem. 1996 Jan;4(l):5-23). This chemistry has three important 
consequences: firstly, in contrast to DNA or phosphorothioate oligonucleotides, PNAs 
are neutral molecules; secondly, PNAs are achiral, which avoids the need to develop a 
5 stereoselective synthesis; and thirdly, PNA synthesis uses standard Boc or Fmoc 
protocols for solid-phase peptide synthesis, although other methods, including a 
modified Merrifield method, have been used. 

PNA monomers or ready-made oligomers are commercially available 
from PerSeptive Biosystems (Framingjiam, MA). PNA syntheses by either Boc or 

10 Fmoc protocols are straightforward using manual or automated protocols (Norton et al, 
Bioorg Med Chem. 1995 Apr;3(4):437-45). The manual protocol lends itself to the 
production of chemically modified PNAs or the simultaneous synthesis of families of 
closely related PNAs. 

As with peptide synthesis, the success of a particular PNA synthesis will 

15 depend on the properties of the chosen sequence. For example, while in theory PNAs 
can incorporate any combination of nucleotide bases, the presence of adjacent purines 
can lead to deletions of one or more residues in the product. In expectation of this 
difficulty, it is suggested that, in producing PNAs with adjacent purines, one should 
repeat the coupling of residues likely to be added inefficiently. This should be followed 

20 by the purification of PNAs by reverse-phase high-pressure liquid chromatography, 
providing yields and purity of product similar to those observed during the synthesis of 
peptides. 

Modifications of PNAs for a given application may be accomplished by 
coupling amino acids during solid-phase synthesis or by attaching compounds that 

25 contain a carboxylic acid group to the exposed N-terminal amine. Alternatively, PNAs 
can be modified after synthesis by coupling to an introduced lysine or cysteine. The 
ease with which PNAs can be modified facilitates optimization for better solubility or 
for specific functional requirements. Once synthesized, the identity of PNAs and their 
derivatives can be confirmed by mass spectrometry. Several studies have made and 

30 utilized modifications of PNAs (for example, Norton et al, Bioorg Med Chem. 1995 
Apr,3(4):437-45; Petersen et al, J Pept Sci. 1995 May-Jun; 1(3): 175-83; Orum et al, 
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Biotechniques. 1995 Sep;19(3):472-80; Footer et al, Biochemistry. 1996 Aug 
20;35(33):10673-9; Griffith et al, Nucleic Acids Res. 1995 Aug ll;23(15):3003-8; 
Pardridge et al, Proc Natl Acad Sci USA. 1995 Jun 6;92(12):5592-6; Boffa et al, 
Proc Natl Acad Sci USA. 1995 Mar 14;92(6):1901-5; Gambacorti-Passerini et al, 
5 Blood. 1996 Aug 15;88(4):141 1-7; Annitage et al, Proc Natl Acad Sci USA. 1997 
Nov 11;94(23): 12320-5; Seeger et al, Biotechniques. 1997 Sep;23(3):512-7). U.S. 
Patent No. 5,700,922 discusses PNA-DNA-PNA chimeric molecules and their uses in 
diagnostics, modulating protein in organisms, and treatment of conditions susceptible to 
therapeutics. 

10 Methods of characterizing the antisense binding properties of PNAs are 

discussed in Rose (Anal Chem. 1993 Dec 15;65(24):3545-9) and Jensen et al 
(Biochemistry. 1997 Apr 22;36(16):5072-7). Rose uses capillary gel electrophoresis to 
determine binding of PNAs to their complementary oligonucleotide, measuring the 
relative binding kinetics and stoichiometry. Similar types of measurements were made 

15 by Jensen et al using BIAcore™ technology. 

Other applications of PNAs that have been described and will be 
apparent to the skilled artisan include use in DNA strand invasion, antisense inhibition, 
mutational analysis, enhancers of transcription, nucleic acid purification, isolation of 
transcriptionally active genes, blocking of transcription factor binding, genome 

20 cleavage, biosensors, in situ hybridization, and the like. 

Polynucleotide Identification, Characterization and Expression 

Polynucleotides compositions of the present invention may be identified, 
prepared and/or manipulated using any of a variety of well established techniques (see 
generally, Sambrook et al,, Molecular Cloning: A Laboratory Manual, Cold Spring 

25 Harbor Laboratories, Cold Spring Harbor, NY, 1989, and other like references). For 
example, a polynucleotide may be identified, as described in more detail below, by 
screening a microarray of cDNAs for tumor-associated expression (ie. f expression that 
is at least two fold greater in a tumor than in noimal tissue, as determined using a * 
representative assay provided herein). Such screens may be performed, for example, 

30 using the microarray technology of Affymetrix, Inc. (Santa Clara, CA) according to the 
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manufacturer's instructions (and essentially as described by Schena et al., Proc. Natl 
Acad Set USA PJ:10614-10619, 1996 and Heller et al., Proc. Natl Acad Sci USA 
P4:2150-2155, 1997). Alternatively, polynucleotides may be amplified from cDNA 
prepared from cells expressing the proteins described herein, such as tumor cells. 
5 Many template dependent processes are available to amplify a target 

sequences of interest present in a sample. One of the best known amplification methods 
is the polymerase chain reaction (PCR™) which is described in detail in U.S. Patent 
Nos. 4,683,195, 4,683,202 and 4,800,159, each of which is incorporated herein by 
reference in its entirety. Briefly, in PCR™, two primer sequences are prepared which 

10 are complementary to regions on opposite complementary strands of the target 
sequence. An excess of deoxynucleoside triphosphates is added to a reaction mixture 
along with a DNA polymerase (e.g., Tag polymerase). If the target sequence is present 
in a sample, the primers will bind to the target and the polymerase will cause the 
primers to be extended along the target sequence by adding on nucleotides. By raising 

15 and lowering the temperature of the reaction mixture, the extended primers will 
dissociate from the target to form reaction products, excess primers will bind to the 
target and to the reaction product and the process is repeated. Preferably reverse 
transcription and PCR™ amplification procedure may be performed in order to quantify 
the amount of mRNA amplified. Polymerase chain reaction methodologies are well 

20 known in the art. 

Any of a number of other template dependent processes, many of which 
are variations of the PCR ™ amplification technique, are readily known and available in 
the art. Illustratively, some such methods include the ligase chain reaction (referred to 
as LCR), described, for example, in Eur. Pat. Appl. Publ. No. 320,308 and U.S. Patent 

25 No. 4,883,750; Qbeta Replicase, described in PCT Intl. Pat Appl. Publ. No. 
PCT/US87/O0880; Strand Displacement Amplification (SDA) and Repair Chain 
Reaction (RCR). Still other amplification methods are described in Great Britain Pat. 
Appl. No. 2 202 328, and in PCT Intl. Pat. Appl. Publ. No. PCT/US89/01025. Other 
nucleic acid amplification procedures include transcription-based amplification systems 

30 (TAS) (PCT Intl. Pat. Appl. Publ. No. WO 88/10315), including nucleic acid sequence 
based amplification (NASBA) and 3SR. Eur. Pat. Appl. Publ. No. 329,822 describes a 
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nucleic acid amplification process involving cyclically synthesizing single-stranded 
RNA ("ssRNA"), ssDNA, and double-stranded DNA (dsDNA). PCT Intl. Pat Appl. 
PubL No. WO 89/06700 describes a nucleic acid sequence amplification scheme based 
on the hybridization of a promoter/primer sequence to a target single-stranded DNA 
5 ("ssDNA") followed by transcription of many RNA copies of the sequence. Other 
amplification methods such as "RACE" (Frohman, 1990), and "one-sided PCR" (Ohara, 
1989) are also well-known to those of skill in the art. 

An amplified portion of a polynucleotide of the present invention may be 
used to isolate a full length gene from a suitable library (e.g., a tumor cDNA library) 

10 using well known techniques. Within such techniques, a library (cDNA or genomic) is 
screened using one or more polynucleotide probes or primers suitable for amplification. 
Preferably, a library is size-selected to include larger molecules. Random primed 
libraries may also be preferred for identifying 5' and upstream regions of genes. 
Genomic libraries are preferred for obtaining introns and extending 5' sequences. 

15 For hybridization techniques, a partial sequence may be labeled (e.g., by 

nick-translation or end-labeling with 32 P) using well known techniques. A bacterial or 
bacteriophage library is then generally screened by hybridizing filters containing 
denatured bacterial colonies (or lawns containing phage plaques) with the labeled probe 
(see Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 

20 Laboratories, Cold Spring Harbor, NY, 1989). Hybridizing colonies or plaques are 
selected and expanded, and the DNA is isolated for further analysis. cDNA clones may 
be analyzed to determine the amount of additional sequence by, for example, PCR using 
a primer from the partial sequence and a primer from the vector. Restriction maps and 
partial sequences may be generated to identify one or more overlapping clones. The 

25 complete sequence may then be determined using standard techniques, which may 
involve generating a series of deletion clones. The resulting overlapping sequences can 
then assembled into a single contiguous sequence. A full length cDNA molecule can be 
generated by ligating suitable fragments, using well known techniques. 

Alternatively, amplification techniques, such as those described above, 

30 can be useful for obtaining a full length coding sequence from a partial cDNA sequence. 
One such amplification technique is inverse PCR (see Triglia et al., Nucl Acids Res. 
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16: 8 186, 1988), which uses restriction enzymes to generate a fragment in the known 
region of the gene. The fragment is then circularized by intramolecular ligation and 
used as a template for PCR with divergent primers derived from the known region. 
Within an alternative approach, sequences adjacent to a partial sequence may be 
5 retrieved by amplification with a primer to a linker sequence and a primer specific to a 
known region. The amplified sequences are typically subjected to a second round of 
amplification with the same linker primer and a second primer specific to the known 
region. A variation on this procedure, which employs two primers that initiate 
extension in opposite directions from the known sequence, is described in WO 

10 96/38591. Another such technique is known as "rapid amplification of cDNA ends" or 
RACE. This technique involves the use of an internal primer and an external primer, 
which hybridizes to a polyA region or vector sequence, to identify sequences that are 5' 
and 3' of a known sequence. Additional techniques include capture PCR (Lagerstrom et 
al., PCR Methods Applic. 7:111-19, 1991) and walking PCR (Parker et al., Nucl Acids. 

15 Res. 7P:3055-60, 1991). Other methods employing amplification may also be employed 
to obtain a full length cDNA sequence. 

In certain instances, it is possible to obtain a full length cDNA sequence 
by analysis of sequences provided in an expressed sequence tag (EST) database, such as 
that available from GenBank. Searches for overlapping ESTs may generally be 

20 performed using well known programs (e.g., NCBI BLAST searches), and such ESTs 
may be used to generate a contiguous full length sequence. Full length DNA sequences 
may also be obtained by analysis of genomic fragments. 

In other embodiments of the invention, polynucleotide sequences or 
fragments thereof which encode polypeptides of the invention, or fusion proteins or 

25 functional equivalents thereof, may be used in recombinant DNA molecules to direct 
expression of a polypeptide in appropriate host cells. Due to the inherent degeneracy of 
the genetic code, other DNA sequences that encode substantially the same or a 
functionally equivalent amino acid sequence may be produced and these sequences may 
be used to clone and express a given polypeptide. 

30 As will be understood by those of skill in the art, it may be advantageous 

in some instances to produce polypeptide-encoding nucleotide sequences possessing 
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non-naturaUy occurring codons. For example, codons preferred by a particular 
prokaiyotic or eukaryotic host can be selected to increase the rate of protein expression 
or to produce a recombinant RNA transcript having desirable properties, such as a half- 
life which is longer than that of a transcript generated from the naturally occurring 
5 sequence. 

Moreover, the polynucleotide sequences of the present invention can be 
engineered using methods generally known in the art in order to alter polypeptide 
encoding sequences for a variety of reasons, including but not limited to, alterations 
which modify the cloning, processing, and/or expression of the gene product. For 

10 example, DNA shuffling by random fragmentation and PCR reassembly of gene 
fragments and synthetic oligonucleotides may be used to engineer the nucleotide 
sequences. In addition, site-directed mutagenesis may be used to insert new restriction 
sites, alter glycosylation patterns, change codon preference, produce splice variants, or 
introduce mutations, and so forth. 

15 In another embodiment of the invention, natural, modified, or 

recombinant nucleic acid sequences may be ligated to a heterologous sequence to 
encode a fusion protein. For example, to screen peptide libraries for inhibitors of 
polypeptide activity, it may be useful to encode a chimeric protein that can be 
recognized by a commercially available antibody. A fusion protein may also be 

20 engineered to contain a cleavage site located between the polypeptide-encoding 
sequence and the heterologous protein sequence, so that the polypeptide may be cleaved 
and purified away from the heterologous moiety. 

Sequences encoding a desired polypeptide may be synthesized, in whole 
or in part, using chemical methods well known in the art (see Caruthers, M. H. et al. 

25 (1980) Nucl Acids Res. Symp. Ser. 215-223, Horn, T. et al. (1980) NucL Acids Res. 
Symp. Ser. 225-232). Alternatively, the protein itself may be produced using chemical 
methods, to synthesize the amino acid sequence of a polypeptide, or a portion thereof. 
For example, peptide synthesis can be performed using various solid-phase techniques 
(Roberge, J. Y. et al. (1995) Science 269:202-204) and automated synthesis may be 

30 achieved, for example, using the ABI 431 A Peptide Synthesizer (Perkin Elmer, Palo 
Alto,CA). 
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A newly synthesized peptide may be substantially purified by preparative 
high performance liquid chromatography (e.g. , Creighton, T. (1 983) Proteins, Structures 
and Molecular Principles, WH Freeman and Co., New York, N.Y.) or other comparable 
techniques available in the art. The composition of the synthetic peptides may be 
5 confirmed by amino acid analysis or sequencing (e.g., the Edman degradation 
procedure). Additionally, the amino acid sequence of a polypeptide, or any part thereof, 
may be altered during direct synthesis and/or combined using chemical methods with 
sequences from other proteins, or any part thereof, to produce a variant polypeptide. 

In order to express a desired polypeptide, the nucleotide sequences 

10 encoding the polypeptide, or functional equivalents, may be inserted into appropriate 
expression vector* i.e. 9 a vector which contains the necessary elements for the 
transcription and translation of the inserted coding sequence. Methods which are well 
known to those skilled in the art may be used to construct expression vectors containing 
sequences encoding a polypeptide of interest and appropriate transcriptional and 

15 translational control elements. These methods include in vitro recombinant DNA 
techniques, synthetic techniques, and in vivo genetic recombination. Such techniques 
are described, for example, in Sambrook, J. et al. (1989) Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y., and Ausubel, F. M. et 
al. (1989) Current Protocols in Molecular Biology, John Wiley & Sons, New York. 

20 N.Y. 

A variety of expression vector/host systems may be utilized to contain 
and express polynucleotide sequences. These include, but are not limited to, 
microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, 
or cosmid DNA expression vectors; yeast transformed with yeast expression vectors; 

25 insect cell systems infected with virus expression vectors (e.g., baculovirus); plant cell 
systems transformed with virus expression vectors (e.g., cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or 
pBR322 plasmids); or animal cell systems. 

The "control elements 1 ' or "regulatory sequences" present in an 

30 expression vector are those non-translated regions of the vector-enhancers, promoters, 
5 f and 3 r untranslated regions-which interact with host cellular proteins to carry out 
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transcription and translation. Such elements may vary in their strength and specificity. 
Depending on the vector system and host utilized, any number of suitable transcription 
and translation elements, including constitutive and inducible promoters, may be used. 
For example, when cloning in bacterial systems, inducible promoters such as the hybrid 
5 lacZ promoter of the PBLUESCRIPT phagemid (Stratagene, La Jolla, Calif.) or 
PSPORT1 plasmid (Gibco BRL, Gaithersburg, MD) and the like may be used. In 
mammalian cell systems, promoters from mammalian genes or from mammalian viruses 
are generally preferred. If it is necessary to generate a cell line that contains multiple 
copies of the sequence encoding a polypeptide, vectors based on SV40 or EBV may be 

10 advantageously used with an appropriate selectable marker. 

In bacterial systems, any of a number of expression vectors may be 
selected depending upon the use intended for the expressed polypeptide. For example, 
when large quantities are needed, for example for the induction of antibodies, vectors 
which direct high level expression of fusion proteins that are readily purified may be 

15 used. Such vectors include, but are not limited to, the multifunctional E. colt cloning 
and expression vectors such as BLUESCRIPT (Stratagene), in which the sequence 
encoding the polypeptide of interest may be ligated into the vector in frame with 
sequences for the ammo-terminal Met and the subsequent 7 residues of .beta.- 
galactosidase so that a hybrid protein is produced; pIN vectors (Van Heeke, G. and S. 

20 M. Schuster (1989) J. Biol Chem. 2^:5503-5509); and the like. pGEX Vectors ' 
(Promega, Madison, Wis.) may also be used to express foreign polypeptides as fusion 
proteins with glutathione S-transferase (GST). In general, such fusion proteins are 
soluble and can easily be purified from lysed cells by adsorption to glutathione-agarose 
beads followed by elution in the presence of free glutathione. Proteins made in such 

25 systems may be designed to include heparin, thrombin, or factor XA protease cleavage 
sites so that the cloned polypeptide of interest can be released from the GST moiety at 
will. 

In the yeast, Saccharomyces cerevisiae, a number of vectors containing 
constitutive or inducible promoters such as alpha factor, alcohol oxidase, and PGH may 
30 be used. For reviews, see Ausubel et al. (supra) and Grant et al. (1987) Methods 
Enzymol 153:516-544. 
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In cases where plant expression vectors are used, the expression of 
sequences encoding polypeptides may be driven by any of a number of promoters. For 
example, viral promoters such as the 35S and 19S promoters of CaMV may be used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. 
5 (1987) EMBO J. 5:307-31 1. Alternatively, plant promoters such as the small subunit of 
RUBISCO or heat shock promoters may be used (Coruzzi, G. et al. (1984) EMBO J. 
3:1671-1680; Broglie, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) 
Results Probl Cell Differ. 77:85-105). These constructs can be introduced into plant 
cells by direct DNA transformation or pathogen-mediated transfection. Such techniques 

10 are described in a number of generally available reviews (see, for example, Hobbs, S. or 
Murry, L. E. in McGraw Hill Yearbook of Science and Technology (1992) McGraw 
Hill, New York, N.Y.; pp. 191-196). 

An insect system may also be used to express a polypeptide of interest. 
For example, in one such system, Autographa californica nuclear polyhedrosis virus 

15 (AcNPV) is used as a vector to express foreign genes in Spodoptera frugiperda cells or 
in Trichoplusia larvae. The sequences encoding the polypeptide may be cloned into a 
non-essential region of the virus, such as the polyhedrin gene, and placed under control 
of the polyhedrin promoter. Successful insertion of the polypeptide-encoding sequence 
will render the polyhedrin gene inactive and produce recombinant virus lacking coat 

20 protein. The recombinant viruses may then be used to infect, for example, S. frugiperda 
cells or Trichoplusia larvae in which the polypeptide of interest may be expressed 
(Engelhard, E. K. et al. (1994) Proc. Natl. Acad Sci. 91 :3224-3227). 

In mammalian host cells, a number of viral-based expression systems are 
generally available. For example, in cases where an adenovirus is used as an expression 

25 vector, sequences encoding a polypeptide of interest may be ligated into an adenovirus 
transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used 
to obtain a viable virus which is capable of expressing the polypeptide in infected host 
cells (Logan, J. and Shenk, T. (1984) Proc. Natl. Acad Sci. Si:3655-3659). In addition, 

30 transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be used 
to increase expression in mammalian host ceils. 
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Specific initiation signals may also be used to achieve more efficient 
translation of sequences encoding a polypeptide of interest Such signals include the 
ATG initiation codon and adjacent sequences. In cases where sequences encoding the 
polypeptide, its initiation codon, and upstream sequences are inserted into the 
5 appropriate expression vector, no additional transcriptional or translational control 
signals may be needed. However, in cases where only coding sequence, or a portion 
thereof, is inserted, exogenous translational control signals including the ATG initiation 
codon should be provided. Furthermore, the initiation codon should be in the correct 
reading frame to ensure translation of the entire insert. Exogenous translational 

10 elements and initiation codons may be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclusion of enhancers which are 
appropriate for the particular cell system which is used, such as those described in the 
literature (Scharf, D. et al. (1994) Results Probl Cell Differ. 20:125-162). 

In addition, a host cell strain may be chosen for its ability to modulate 

15 the expression of the inserted sequences or to process the expressed protein in the 
desired fashion. Such modifications of the polypeptide include, but are not limited to, 
acetylatioa, carboxylation. glycosylation, phosphorylation, lipidation, and acylation. 
Post-translational processing which cleaves a "prepro" form of the protein may also be 
used to facilitate correct insertion, folding and/or function. Different host cells such as 

20 CHO, COS, HeLa, MDCK, HEK293, and WI38, which have specific cellular machinery 
and characteristic mechanisms for such post-translational activities, may be chosen to 
ensure the correct modification and processing of the foreign protein. 

For long-term, high-yield production of recombinant proteins, stable 
expression is generally preferred. For example, cell lines which stably express a 

25 polynucleotide of interest may be transformed using expression vectors which may 
contain viral origins of replication and/or endogenous expression elements and a 
selectable marker gene on the same or on a separate vector. Following the introduction 
of the vector, cells may be allowed to grow for 1-2 days in an enriched media before 
they are switched to selective media. The purpose of the selectable marker is to confer 

30 resistance to selection, and its presence allows growth and recovery of cells which 
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successfully express the introduced sequences. Resistant clones of stably transformed 
cells may be proliferated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed 
cell lines. These include, but are not limited to, the herpes simplex virus thymidine 
5 kinase (Wigler, M. et al. (1977) Cell 77:223-32) and adenine phosphoribosyltransferase 
(Lowy, L et al. (1990) Cell 22:817-23) genes which can be employed in tLsup.- or 
aprtsup.- cells, respectively. Also, antimetabolite, antibiotic or herbicide resistance can 
be used as the basis for selection; for example, dhfr which confers resistance to 
methotrexate (Wigler, M. et al. (1980) Proc. Natl Acad Sci 77:3567-70); npt, which 

10 confers resistance to the aminoglycosides, neomycin and G-418 (Colbere-Garapin, F. et 
al (1981) J. Mol. Biol. 750:1-14); and als or pat, which confer resistance to 
chlorsulfuron and phosphinotricin acetyltransferase, respectively (Murry, supra). 
Additional selectable genes have been described, for example, tipB, which allows cells 
to utilize indole in place of tryptophan, or hisD, which allows cells to utilize histinol in 

15 place of histidine (Hartman, S. C. and R. C. Mulligan (1988) Proc. Natl Acad Sci. 
55:8047-51). The use of visible markers has gained popularity with such markers as 
anthocyanins, beta-glucuronidase and its substrate GUS, and luciferase and its substrate 
luciferin, being widely used riot only to identify transformants, but also to quantify the 
amount of transient or stable protein expression attributable to a specific vector system 

20 (Rhodes, C. A. et al. (1995) Methods Mol. Biol. 55:121-131). 

Although the presence/absence of marker gene expression suggests that 
the gene of interest is also present, its presence and expression may need to be 
confirmed. For example, if the sequence encoding a polypeptide is inserted within a 
marker gene sequence, recombinant cells containing sequences can be identified by the 

25 absence of marker gene function. Alternatively, a marker gene can be placed in tandem 
with a polypeptide-encoding sequence under the control of a single promoter. 
Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

Alternatively, host cells that contain and express a desired 

30 polynucleotide sequence may be identified by a variety of procedures known to those of 
skill in the art. These procedures include, but are not limited to, DNA-DNA or DNA- 
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RNA hybridizations and protein bioassay or immunoassay techniques which include, 
for example, membrane, solution, or chip based technologies for the detection and/or 
quantification of nucleic acid or protein. 

A variety of protocols for detecting and measuring the expression of 
5 polynucleotide-encoded products, using either polyclonal or monoclonal antibodies 
specific for the product are known in the art. Examples include enzyme-linked 
immunosorbent assay (ELISA), radioimmunoassay (RIA), and fluorescence activated 
cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on a given polypeptide may be 

10 preferred for some applications, but a competitive binding assay may also be employed. 
These and other assays are described, among other places, in Hampton, R. et al. (1990; 
Serological Methods, a Laboratory Manual, APS Press, St Paul. Minn.) and Maddox, D. 
E. et al. (1983; J. Exp. Med 755:1211-1216). 

A wide variety of labels and conjugation techniques are known by those 

15 skilled in the art and may be used in various nucleic acid and amino acid assays. Means 
for producing labeled hybridization or PCR probes for detecting sequences related to 
polynucleotides include oligolabeling, nick translation, end-labeling or PCR 
amplification using a labeled nucleotide. Alternatively, the sequences, or any portions 
thereof may be cloned into a vector for the production of an mRNA probe. Such vectors 

20 are known in the art, are commercially available, and may be used to synthesize RNA 
probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 
and labeled nucleotides. These procedures may be conducted using a variety of 
commercially available kits. Suitable reporter molecules or labels, which may be used 
include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents 

25 as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with a polynucleotide sequence of interest may be 
cultured under conditions suitable for the expression and recovery of the protein from 
cell culture. The protein produced by a recombinant cell may be secreted or contained 
intracellular^ depending on the sequence and/or the vector used. As will be understood 

30 by those of skill in the art, expression vectors containing polynucleotides of the 
invention may be designed to contain signal sequences which direct secretion of the 
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encoded polypeptide through a prokaryotic or eukaryotic cell membrane. Other 
recombinant constructions may be used to join sequences encoding a polypeptide of 
interest to nucleotide sequence encoding a polypeptide domain which will facilitate 
purification of soluble proteins. Such purification facilitating domains include, but are 
5 not limited to, metal chelating peptides such as histidine-tryptophan modules that allow 
purification on immobilized metals, protein A domains that allow purification on 
immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity 
purification system (Immunex Corp., Seattle, Wash.).. The inclusion of cleavable linker 
sequences such as those specific for Factor XA or enterokinase (Invitrogen. San Diego, 

10 Calif.) between the purification domain and the encoded polypeptide may be used to 
facilitate purification. One such expression vector provides for expression of a fusion 
protein containing a polypeptide of interest and a nucleic acid encoding 6 histidine 
residues preceding a thioredoxin or an enterokinase cleavage site. The histidine residues 
facilitate purification on IMIAC (immobilized metal ion affinity chromatography) as 

15 described in Porath, J. et al. (1992, Prot Exp. Purif 3:263-281) while the enterokinase 
cleavage site provides a means for purifying the desired polypeptide from the fusion 
protein. A discussion of vectors which contain fusion proteins is provided in Kroll, D. J. 
etal. (1993; DNA Cell Biol. 72:441-453). 

In addition to recombinant production methods, polypeptides of the 

20 invention, and fragments thereof, may be produced by direct peptide synthesis using 
solid-phase techniques (Merrifield J. (1963) J. Am. Chem. Soc. 55:2149-2154). Protein 
synthesis may be performed using manual techniques or by automation. Automated 
synthesis may be achieved, for example, using Applied Biosystems 431 A Peptide 
Synthesizer (Perkin Elmer). Alternatively, various fragments may be chemically 

25 synthesized separately and combined using chemical methods to produce the full length 
molecule. 

Antibody Compositions. Fragments Thereof and Other Binding Agents 

According to another aspect, the present invention further provides 
binding agents, such as antibodies and antigen-binding fragments thereof, that exhibit 
30 immunological binding to a tumor polypeptide disclosed herein, or to a portion, variant 
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or derivative thereof An antibody, or antigen-binding fragment thereof, is said to 
"specifically bind," "immunogically bind," and/or is "immunologically reactive" to a 
polypeptide of the invention if it reacts at a detectable level (within, for example, an 
ELISA assay) with the polypeptide, and does not react detectably with unrelated 
5 polypeptides under similar conditions. 

Immunological binding, as used in this context, generally refers to the 
non-covalent interactions of the type which occur between an immunoglobulin 
molecule and an antigen for which the immunoglobulin is specific. The strength, or 
affinity of immunological binding interactions can be expressed in terms of the 

10 dissociation constant (Kd) of the interaction, wherein a smaller K<i represents a greater 
affinity. Immunological binding properties of selected polypeptides can be quantified 
using methods well known in the art. One such method entails measuring the rates of 
antigen-binding site/antigen complex formation and dissociation, wherein those rates 
depend on the concentrations of the complex partners, the affinity of the interaction, and 

15 on geometric parameters that equally influence the rate in both directions. Thus, both 
the "on rate constant" (Ko n ) and the "off rate constant" (Ko ff ) can be determined by 
calculation of the concentrations and the actual rates of association and dissociation. 
The ratio of Koff /Ko n enables cancellation of all parameters not related to affinity, and is 
thus equal to the dissociation constant Kd. See, generally, Davies et al. (1990) Annual 

20 Rev. Biochem. 59:439-473. 

An "antigen-binding site," or "binding portion" of an antibody refers to 
the part of the immunoglobulin molecule that participates in antigen binding. The 
antigen binding site is formed by amino acid residues of the N-terminal variable ("V") 
regions of the heavy ("H") and light ("L") chains. Three highly divergent stretches 

25 within the V regions of the heavy and light chains are referred to as "hypervariable 
regions" which are interposed between more conserved flanking stretches known as 
"framework regions," or "FRs". Thus the term "FR" refers to amino acid sequences 
which are naturally found between and adjacent to hypervariable regions in 
im m u noglobulins. In an antibody molecule, the three hypervariable regions of a light 

30 chain and the three hypervariable regions of a heavy chain are disposed relative to each 
other in three dimensional space to form an antigen-binding surface. The antigen- 
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binding surface is complementary to the three-dimensional surface of a bound antigen, 
and the three hypervariable regions of each of the heavy and light chains are referred to 
as "complementarity-determining regions," or "CDRs." 

Binding agents may be further capable of differentiating between patients 
5 with and without a cancer, such as lung cancer, using the representative assays provided 
herein. For example, antibodies or other binding agents that bind to a tumor protein 
will preferably generate a signal indicating the presence of a cancer in at least about 
20% of patients with the disease, more preferably at least about 30% of patients. 
Alternatively, or in addition, the antibody will generate a negative signal indicating the 

10 absence of the disease in at least about 90% of individuals without the cancer. To 
determine whether a binding agent satisfies this requirement, biological samples (e.g., 
blood, sera, sputum, urine and/or tumor biopsies) from patients with and without a 
cancer (as determined using standard clinical tests) may be assayed as described herein 
for the presence of polypeptides that bind to the binding agent Preferably, a statistically 

15 significant number of samples with and without the disease will be assayed. Each 
binding agent should satisfy the above criteria; however, those of ordinary skill in the 
art will recognize that binding agents may be used in combination to improve 
sensitivity. 

Any agent that satisfies the above requirements may be a binding agent. 

20 For example, a binding agent may be a ribosome, with or without a peptide component, 
an RNA molecule or a polypeptide. In a preferred embodiment, a binding agent is an 
antibody or an antigen-binding fragment thereof. Antibodies may be prepared by any of 
a variety of techniques known to those of ordinary skill in the art. See, e.g., Harlow and 
Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In 

25 general, antibodies can be produced by cell culture techniques, including the generation 
of monoclonal antibodies as described herein, or via transfection of antibody genes into 
suitable bacterial or mammalian cell hosts, in order to allow for the production of 
recombinant antibodies. In one technique, an immunogen comprising the polypeptide is 
initially injected into any of a wide variety of mammals {e.g., mice, rats, rabbits, sheep 

30 or goats). In this step, the polypeptides of this invention may serve as the immunogen 
without modification. Alternatively, particularly for relatively short polypeptides, a 
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superior immune response may be elicited if the polypeptide is joined to a carrier 
protein, such as bovine serum albumin or keyhole limpet hemocyanin. The immunogen 
is injected into the animal host, preferably according to a predetermined schedule 
incorporating one or more booster immunizations, and the animals are bled periodically. 
5 Polyclonal antibodies specific for the polypeptide may then be purified from such 
antisera by, for example, affinity chromatography using the polypeptide coupled to a 
suitable solid support. 

Monoclonal antibodies specific for an antigenic polypeptide of interest 
may be prepared, for example, using the technique of Kohler and Milstein, Eur. 7. 

10 Immunol. 6:5 11-519, 1976, and improvements thereto. Briefly, these methods involve 
the preparation of immortal cell lines capable of producing antibodies having the 
desired specificity (i.e., reactivity with the polypeptide of interest). Such cell lines may 
be produced, for example, from spleen cells obtained from an animal immunized as 
described above. The spleen cells are then immortalized by, for example, fusion with a 

15 myeloma cell fusion partner, preferably one that is syngeneic with the immunized 
animal. A variety of fusion techniques may be employed. For example, the spleen cells 
and myeloma cells may be combined with a nonionic detergent for a few minutes and 
then plated at low density on a selective medium that supports the growth of hybrid 
cells, but not myeloma cells. A preferred selection technique uses HAT (hypoxanthine, 

20 aminopterin, thymidine) selection. After a sufficient time, usually about 1 to 2 weeks, 
colonies of hybrids are observed. Single colonies are selected and their culture 
supernatants tested for binding activity against the polypeptide. Hybridomas having 
high reactivity and specificity are preferred. 

Monoclonal antibodies may be isolated from the supernatants of growing 

25 hybridoma colonies. In addition, various techniques may be employed to enhance the 
yield, such as injection of the hybridoma cell line into the peritoneal cavity of a suitable 
vertebrate host, such as a mouse. Monoclonal antibodies may then be harvested from 
the ascites fluid or the blood. Contaminants may be removed from the antibodies by 
conventional techniques, such as chromatography, gel filtration, precipitation, and 

30 extraction. The polypeptides of this invention may be used in the purification process 
in, for example, an affinity chromatography step. 
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A number of therapeutically useful molecules are known in the art which 
comprise antigen-binding sites that are capable of exhibiting immunological binding 
properties of an antibody molecule. The proteolytic en2yme papain preferentially 
cleaves IgG molecules to yield several fragments, two of which (the "F(ab) M fragments) 
5 each comprise a covalent heterodimer that includes an intact antigen-binding site. The 
enzyme pepsin is able to cleave IgG molecules to provide several fragments, including 
the "Ffab^ 11 fragment which comprises both antigen-binding sites. An "Fv" fragment 
can be produced by preferential proteolytic cleavage of an IgM, and on rare occasions 
IgG or IgA immunoglobulin molecule. Fv fragments are, however, more commonly 

10 derived using recombinant techniques known in the art. The Fv fragment includes a 
non-covalent Vh::Vl heterodimer including an antigen-binding site which retains much 
of the antigen recognition and binding capabilities of the native antibody molecule. 
Inbar et aL (1972) Proc. Nat. Acad. Sci. USA 69:2659-2662; Hochman et al. (1976) 
Biochem 15:2706-2710; and Ehrlich et al. (1980) Biochem 19:4091-4096. 

15 A single chain Fv ( M sFv' f ) polypeptide is a covalently linked V H ::V L 

heterodimer which is expressed from a gene fusion including Vh- and VL-encoding 
genes linked by a peptide-encoding linker. Huston et al. (1988) Proc. Nat. Acad. Sci. 
USA 85(16):5879-5883. A number of methods have been described to discern chemical 
structures for converting the naturally aggregated-but chemically separated-light and 

20 heavy polypeptide chains from an antibody V region into an sFv molecule which will 
fold into a three dimensional structure substantially similar to the structure of an 
antigen-binding site. See, e.g., U.S. Pat. Nos. 5,091,513 and 5,132,405, to Huston et al.; 
and U.S. Pat. No. 4,946,778, to Ladner et al. 

Each of the above-described molecules includes a heavy chain and a 

25 light chain CDR set, respectively interposed between a heavy chain and a light chain FR 
set which provide support to the CDRS and define the spatial relationship of the CDRs 
relative to each other. As used herein, the term "CDR set" refers to the three 
hypervariable regions of a heavy or light chain V region. Proceeding from the N- 
terminus of a heavy or light chain, these regions are denoted as "CDR1," "CDR2," and 

30 "CDR3" respectively. An antigen-binding site, therefore, includes six CDRs, 
comprising the CDR set from each of a heavy and a light chain V region. A polypeptide 
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comprising a single CDR, (e.g., a CDR1, CDR2 or CDR3) is referred to herein as a 
"molecular recognition unit" Ciystallographic analysis of a number of antigen-antibody 
complexes has demonstrated that the amino acid residues of CDRs form extensive 
contact with bound antigen, wherein the most extensive antigen contact is with the 
5 heavy chain CDR3. Thus, the molecular recognition units are primarily responsible for 
the specificity of an antigen-binding site. 

As used herein, the term "FR set" refers to the four flanking amino acid 
sequences which frame the CDRs of a CDR set of a heavy or light chain V region. 
Some FR residues may contact bound antigen; however, FRs are primarily responsible 

10 for folding the V region into the antigen-binding site, particularly the FR residues 
directly adjacent to the CDRS. Within FRs, certain amino residues and certain structural 
features are very highly conserved. In this regard, all V region sequences contain an 
internal disulfide loop of around 90 amino acid residues. When the V regions fold into a 
binding-site, the CDRs are displayed as projecting loop motifs which form an antigen- 

15 binding surface. It is generally recognized that there are conserved structural regions of 
FRs which influence the folded shape of the CDR loops into certain "canonical" 
structures-regardless of the precise CDR amino acid sequence. Further, certain FR 
residues are known to participate in non-covalent interdomain contacts which stabilize 
the interaction of the antibody heavy and light chains. 

20 A number of "humanized" antibody molecules comprising an antigen- 

binding site derived from a non-human immunoglobulin have been described, including 
chimeric antibodies having rodent V regions and their associated CDRs fused to human 
constant domains (Winter et aL (1991) Nature 349:293-299; Lobuglio et al. (1989) 
Proc. Nat. Acad. Sci. USA 86:4220-4224; Shaw et al. (1987) J Immunol. 138:4534- 

25 4538; and Brown et al. (1987) Cancer Res. 47:3577-3583), rodent CDRs grafted into a 
human supporting FR prior to fusion with an appropriate human antibody constant 
domain (Riechmann et al. (1988) Nature 332:323-327; Verhoeyen et al. (1988) Science 
239:1534-1536; and Jones et al. (1986) Nature 321:522-525), and rodent CDRs 
supported by recombinantly veneered rodent FRs (European Patent Publication No. 

30 519,596, published Dec. 23, 1992). These "humanized" molecules are designed to 
minimize unwanted immunological response toward rodent antihuman antibody 
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molecules which limits the duration and effectiveness of therapeutic applications of 
those moieties in human recipients. 

As used herein, the terms "veneered FRs" and "recombinantly veneered 
FRs" refer to the selective replacement of FR residues from, e.g., a rodent heavy or light 
5 chain V region, with human FR residues in order to provide a xenogeneic molecule 
comprising an antigen-binding site which retains substantially all of the native FR 
polypeptide folding structure. Veneering techniques are based on the understanding that 
the ligand binding characteristics of an antigen-binding site are determined primarily by 
the structure and relative disposition of the heavy and light chain CDR sets within the 

10 antigen-binding surface. Davies et aL (1990) Ann. Rev. Biochem. 59:439-473. Thus, 
antigen binding specificity can be preserved in a humanized antibody only wherein the 
CDR structures, their interaction with each other, and their interaction with the rest of 
the V region domains are carefully maintained. By using veneering techniques, exterior 
{e.g., solvent-accessible) FR residues which are readily encountered by the immune 

15 system are selectively replaced with human residues to provide a hybrid molecule that 
comprises either a weakly immunogenic, or substantially non-irnmunogenic veneered 
surface. 

The process of veneering makes use of the available sequence data for 
human antibody variable domains compiled by Kabat et al., in Sequences of Proteins of 

20 Immunological Interest, 4th ed., (U.S. Dept. of Health and Human Services, U.S. 
Government Printing Office, 1987), updates to the Kabat database, and other accessible 
U.S. and foreign databases (both nucleic acid and protein). Solvent accessibilities of V 
region amino acids can be deduced from the known three-dimensional structure for 
human and murine antibody fragments. There are two general steps in veneering a 

25 murine antigen-binding site. Initially, the FRs of the variable domains of an antibody 
molecule of interest are compared with corresponding FR sequences of human variable 
domains obtained from the above-identified sources. The most homologous human V 
regions are then compared residue by residue to corresponding murine amino acids. The 
residues in the murine FR which differ from the human counterpart are replaced by the 

30 residues present in the human moiety using recombinant techniques well known in the 
art. Residue switching is only carried out with moieties which are at least partially 
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exposed (solvent accessible), and care is exercised in the replacement of amino acid 
residues which may have a significant effect on the tertiary structure of V region 
domains, such as proline, glycine and charged amino acids. 

In this manner, the resultant "veneered" murine antigen-binding sites are 
5 thus designed to retain the murine CDR residues, the residues substantially adjacent to 
the CDRs, the residues identified as buried or mostly buried (solvent inaccessible), the 
residues believed to participate in non-covalent (e.g., electrostatic and hydrophobic) 
contacts between heavy and light chain domains, and the residues from conserved 
structural regions of the FRs which are believed to influence the "canonical" tertiary 
10 structures of the CDR loops. These design criteria are then used to prepare recombinant 
nucleotide sequences which combine the CDRs of both the heavy and light chain of a 
murine antigen-binding site into human-appearing FRs that can be used to transfect 
mammalian cells for the expression of recombinant human antibodies which exhibit the 
antigen specificity of the murine antibody molecule. 

15 In another embodiment of the invention, monoclonal antibodies of the 

present invention may be coupled to one or more therapeutic agents. Suitable agents in 
this regard include radionuclides, differentiation inducers, drugs, toxins, and derivatives 
thereof. Preferred radionuclides include 90 Y, 123 I, ll \ m l, l86 Re, 188 Re, 2U At, and 
Bi. Preferred drugs include methotrexate, and pyrimidine and purine analogs. 

20 Preferred differentiation inducers include phorbol esters and butyric acid. Preferred 
toxins include ricin, abrin, diptheria toxin, cholera toxin, gelonin, Pseudomonas 
exotoxin, Shigella toxin, and pokeweed antiviral protein. 

A therapeutic agent may be coupled (e.g., covalently bonded) to a 
suitable monoclonal antibody either directly or indirectly (e.g., via a linker group). A 

25 direct reaction between an agent and an antibody is possible when each possesses a 
substituent capable of reacting with the other. For example, a nucleophilic group, such 
as an amino or sulfhydryl group, on one may be capable of reacting with a carbonyl- 
containing group, such as an anhydride or an acid halide, or with an alkyl group 
containing a good leaving group (e.g., a halide) on the other. 

30 Alternatively, it may be desirable to couple a therapeutic agent and an 

antibody via a linker group. A linker group can function as a spacer to distance an 
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antibody from an agent in order to avoid interference with binding capabilities. A linker 
group can also serve to increase the chemical reactivity of a substituent on an agent or 
an antibody, and thus increase the coupling efficiency. An increase in chemical 
reactivity may also facilitate the use of agents, or functional groups on agents, which 
5 otherwise would not be possible. 

It will be evident to those skilled in the art that a variety of Afunctional 
or polyfunctional reagents, both homo- and hetero-functional (such as those described in 
the catalog of the Pierce Chemical Co., Rockford, IL), may be employed as the linker 
group. Coupling may be effected, for example, through amino groups, carboxyl groups, 

10 sulfhydryl groups or oxidized carbohydrate residues. There are numerous references 
describing such methodology, e.g., U.S. Patent No. 4,671,958, to Rodwell et al. 

Where a therapeutic agent is more potent when free from the antibody 
portion of the immunoconjugates of the present invention, it may be desirable to use a 
linker group which is cleavable during or upon internalization into a cell. A number of 

15 different cleavable linker groups have been described. The mechanisms for the 
intracellular release of an agent from these linker groups include cleavage by reduction 
of a disulfide bond (e.g., U.S. Patent No. 4,489,710, to Spitler), by irradiation of a 
photolabile bond (e.g., U.S. Patent No. 4,625,014, to Senter etaL), by hydrolysis of 
derivatized amino acid side chains (e.g 9 U.S. Patent No. 4,638,045, to Kohn et al.), by 

20 serum complement-mediated hydrolysis (e.g., U.S. Patent No. 4,671,958, to Rodwell 
et al.), and acid-catalyzed hydrolysis (e.g., U.S. Patent No. 4,569,789, to Blattler et al.). 

It may be desirable to couple more than one agent to an antibody. In one 
embodiment, multiple molecules of an agent are coupled to one antibody molecule. In 
another embodiment, more than one type of agent may be coupled to one antibody. 

25 Regardless of the particular embodiment, immunoconjugates with more than one agent 
may be prepared in a variety of ways. For example, more than one agent may be 
coupled directly to an antibody molecule, or linkers that provide multiple sites for 
attachment can be used. Alternatively, a carrier can be used. 

A carrier may bear the agents in a variety of ways, including covalent 

30 bonding either directly or via a linker group. Suitable carriers include proteins such as 
albumins (e.g., U.S. Patent No. 4,507,234, to Kato et al.), peptides and polysaccharides 
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such as aminodextran (e.g., U.S. Patent No. 4,699,784, to Shih et al.). A carrier may 
also bear an agent by noncovalent bonding or by encapsulation, such as within a 
liposome vesicle (e.g., U.S. Patent Nos. 4,429,008 and 4,873,088). Carriers specific for 
radionuclide agents include radiohalogenated small molecules and chelating 
5 compounds. For example, U.S. Patent No. 4,735,792 discloses representative 
radiohalogenated small molecules and their synthesis. A radionuclide chelate may be 
formed from chelating compounds that include those containing nitrogen and sulfur 
atoms as the donor atoms for binding the metal, or metal oxide, radionuclide. For 
example, U.S. Patent No. 4,673,562, to Davison et al. discloses representative chelating 
1 0 compounds and their synthesis. 

T Cell Compositions 

The present invention, in another aspect, provides T cells specific for a 
tumor polypeptide disclosed herein, or for a variant or derivative thereof. Such cells 
may generally be prepared in vitro or ex vivo, using standard procedures. For example, 

15 T cells may be isolated from bone marrow, peripheral blood, or a fraction of bone 
marrow or peripheral blood of a patient, using a commercially available cell separation 
system, such as the Isolex™ System, available from Nexell Therapeutics, Inc. (Irvine, 
CA; see also U.S. Patent No. 5,240,856; U.S. Patent No. 5,215,926; WO 89/06280; WO 
91/16116 and WO 92/07243). Alternatively, T cells may be derived from related or 

20 unrelated humans, non-human mammals, cell lines or cultures. 

T cells may be stimulated with a polypeptide, polynucleotide encoding a 
polypeptide and/or an antigen presenting cell (APC) that expresses such a polypeptide. 
Such stimulation is performed under conditions and for a time sufficient to permit the 
generation of T cells that are specific for the polypeptide of interest. Preferably, a tumor 

25 polypeptide or polynucleotide of the invention is present within a delivery vehicle, such 
as a microsphere, to facilitate the generation of specific T cells. 

T cells are considered to be specific for a polypeptide of the present 
invention if the T cells specifically proliferate, secrete cytokines or kill target cells 
coated with the polypeptide or expressing a gene encoding the polypeptide. T cell 

30 specificity may be evaluated using any of a variety of standard techniques. For 
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example, within a chromium release assay or proliferation assay, a stimulation index of 
more than two fold increase in lysis and/or proliferation, compared to negative controls, 
indicates T cell specificity. Such assays may be performed, for example, as described in 
Chen et al., Cancer Res. 54:1065-1070, 1994. Alternatively, detection of the 
5 proliferation of T cells may be accomplished by a variety of known techniques. For 
example, T cell proliferation can be detected by measuring an increased rate of DNA 
synthesis (e.g., by pulse-labeling cultures of T cells with tritiated thymidine and 
measuring the amount of tritiated thymidine incorporated into DNA). Contact with a 
tumor polypeptide (100 ng/ml - 100 fig/ml, preferably 200 ng/ml - 25 ng/ml) for 3 - 7 

10 days will typically result in at least a two fold increase in proliferation of the T cells. 
Contact as described above for 2-3 hours should result in activation of the T cells, as 
measured using standard cytokine assays in which a two fold increase in the level of 
cytokine release (e.g., TNF or IFN-y) is indicative of T cell activation (see Coligan et 
al., Current Protocols in Immunology, vol. 1, Wiley Interscience (Greene 1998)). T 

15 cells that have been activated in response to a tumor polypeptide, polynucleotide or 
polypeptide-expressing APC may be CD4 + and/or CD8 + . Tumor polypeptide-specific T 
cells may be expanded using standard techniques. Within preferred embodiments, the T 
cells are derived from a patient, a related donor or an unrelated donor, and are 
administered to the patient following stimulation and expansion. 

20 For therapeutic purposes, CD4 + or CD8 + T cells that proliferate in 

response to a tumor polypeptide, polynucleotide or APC can be expanded in number 
either in vitro or in vivo. Proliferation of such T cells in vitro may be accomplished in a 
variety of ways. For example, the T cells can be re-exposed to a tumor polypeptide, or a 
short peptide corresponding to an immunogenic portion of such a polypeptide, with or 

25 without the addition of T cell growth factors, such as interleukin-2, and/or stimulator 
cells that synthesize a tumor polypeptide. Alternatively, one or more T cells that 
proliferate in the presence of the tumor polypeptide can be expanded in number by 
cloning. Methods for cloning cells are well known in the art, and include limiting 
dilution. 
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T Cell Receptor Compositions 

The T cell receptor (TCR) consists of 2 different, highly variable 
polypeptide chains, termed the T-cell receptor a and p chains, that are linked by a 
disulfide bond (Janeway, Travers, Walport Immunobiology, Fourth Ed., 148-159. 
5 Elsevier Science Ltd/Garland Publishing. 1999). The <x/p heterodimer complexes with 
the invariant CD3 chains at the cell membrane. This complex recognizes specific 
antigenic peptides bound to MHC molecules. The enormous diversity of TCR 
specificities is generated much like immunoglobulin diversity, through somatic gene 
rearrangement The p chain genes contain over 50 variable (V), 2 diversity (D), over 10 

10 joining (J) segments, and 2 constant region segments (C). The a chain genes contain 
over 70 V segments, and over 60 J segments but no D segments, as well as one C 
segment. During T cell development in the thymus, the D to J gene rearrangement of 
the P chain occurs, followed by the V gene segment rearrangement to the DJ. This 
functional VDJp exon is transcribed and spliced to join to a Cp. For the a chain, a V a 

15 gene segment rearranges to a J a gene segment to create the functional exon that is then 
transcribed and spliced to the C a . Diversity is further increased during the 
recombination process by the random addition of P and N-nucleotides between the V, 
D, and J segments of the P chain and between the V and J segments in the a chain 
(Janeway, Travers, Walport. Immunobiology. Fourth Ed., 98 and 150. Elsevier Science 

20 Ltd/Garland Publishing. 1999). 

The present invention, in another aspect, provides TCRs specific for a 
polypeptide disclosed herein, or for a variant or derivative thereof. In accordance with 
the present invention, polynucleotide and amino acid sequences are provided for the V-J 
or V-D-J junctional regions or parts thereof for the alpha and beta chains of the T-cell 

25 receptor which recognize tumor polypeptides described herein. In general, this aspect 
of the invention relates to T-cell receptors which recognize or bind tumor polypeptides 
presented in the context of MHC. In a preferred embodiment the tumor antigens 
recognized by the T-cell receptors comprise a polypeptide of the present invention. For 
example, cDNA encoding a TCR specific for a Jumor peptide can be isolated from T 

30 cells specific for a tumor polypeptide using standard molecular biological and 
recombinant DNA techniques. 
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This invention further includes the T-cell receptors or analogs thereof 
having substantially the same function or activity as the T-cell receptors of this 
invention which recognize or bind tumor polypeptides. Such receptors include, but are 
not limited to, a fragment of the receptor, or a substitution, addition or deletion mutant 
5 of a T-cell receptor provided herein. This invention also encompasses polypeptides or 
peptides that are substantially homologous to the T-cell receptors provided herein or 
that retain substantially the same activity. The term "analog" includes any protein or 
polypeptide having an amino acid residue sequence substantially identical to the T-cell 
receptors provided herein in which one or more residues, preferably no more than 5 

10 residues, more preferably no more than 25 residues have been conservatively substituted 
with a functionally similar residue and which displays the functional aspects of the T- 
cell receptor as described herein. 

The present invention further provides for suitable mammalian host 
cells, for example, non-specific T cells, that are transfected with a polynucleotide 

15 encoding TCRs specific for a polypeptide described herein, thereby rendering the host 
cell specific for the polypeptide. The a and p chains of the TCR may be contained on 
separate expression vectors or alternatively, on a single expression vector that also 
contains an internal ribosome entry site (IRES) for cap-independent translation of the 
gene downstream of the IRES. Said host cells expressing TCRs specific for -the 

20 polypeptide may be used, for example, for adoptive immunotherapy of lung cancer as 
discussed further below. 

In further aspects of the present invention, cloned TCRs specific for a 
polypeptide recited herein may be used in a kit for the diagnosis of lung cancer. For 
example, the nucleic acid sequence or portions thereof, of tumor-specific TCRs can be 
25 used as probes or primers for the detection of expression of the rearranged genes 
encoding the specific TCR in a biological sample. Therefore, the present invention 
further provides for an assay for detecting messenger RNA or DNA encoding the TCR 
specific for a polypeptide. 



78 



WO 02/47534 



PCTYUS01/47576 



Pharmaceutical Compositions 

In additional embodiments, the present invention concerns formulation 
of one or more of the polynucleotide, polypeptide, T-cell and/or antibody compositions 
disclosed herein in pharmaceutically-acceptable carriers for administration to a cell or 
5 an animal, either alone, or in combination with one or more other modalities of therapy. 

It will be understood that, if desired, a composition as disclosed herein 
may be administered in combination with other agents as well, such as, e.g., other 
proteins or polypeptides or various pharmaceutically-active agents. In fact, there is 
virtually no limit to other components that may also be included, given that the 

10 additional agents do not cause a significant adverse effect upon contact with the target 
cells or host tissues. The compositions may thus be delivered along with various other 
agents as required in the particular instance. Such compositions may be purified from 
host cells or other biological sources, or alternatively may be chemically synthesized as 
described herein. Likewise, such compositions may further comprise substituted or 

1 5 derivatized RNA or DNA compositions. 

Therefore, in another aspect of the present invention, pharmaceutical 
compositions are provided comprising one or more of the polynucleotide, polypeptide, 
antibody, and/or T-cell compositions described herein in combination with a 
physiologically acceptable carrier. In certain preferred embodiments, the 

20 pharmaceutical compositions of the invention comprise immunogenic polynucleotide 
and/or polypeptide compositions of the invention for use in prophylactic and theraputic 
vaccine applications. Vaccine preparation is generally described in, for example, M.F. 
Powell and M J. Newman, eds., "Vaccine Design (the subunit and adjuvant approach)," 
Plenum Press (NY, 1995). Generally, such compositions will comprise one or more 

25 polynucleotide and/or polypeptide compositions of the present invention in combination 
with one or more immunostimulants. 

It will be apparent that any of the pharmaceutical compositions described 
herein can contain pharmaceutically acceptable salts of the polynucleotides and 
polypeptides of the invention. Such salts can be prepared, for example, from 

30 pharmaceutically acceptable non-toxic bases, including organic bases (e.g., salts of 



79 



WO 02/47534 PCI7US01/47576 

primary, secondary and tertiary amines and basic amino acids) and inorganic bases (e.g., 
sodium, potassium, lithium, ammonium, calcium and magnesium salts). 

In another embodiment, illustrative immunogenic compositions, e.g., 
vaccine compositions, of the present invention comprise DNA encoding one or more of 
5 the polypeptides as described above, such that the polypeptide is generated in situ. As 
noted above, the polynucleotide may be administered within any of a variety of delivery 
systems known to those of ordinary skill in the art. Indeed, numerous gene delivery 
techniques are well known in the art, such as those described by Rolland, Crit. Rev. 
Therap. Drug Carrier Systems 75:143-198, 1998, and references cited therein. 

10 Appropriate polynucleotide expression systems will, of course, contain the necessary 
regulatory DNA regulatory sequences for expression in a patient (such as a suitable 
promoter and terminating signal). Alternatively, bacterial delivery systems may involve 
the administration of a bacterium (such as Bacillus-Calmette-Guerriri) that expresses an 
immunogenic portion of the polypeptide on its cell surface or secretes such an epitope. 

15 Therefore, in certain embodiments, polynucleotides encoding 

immunogenic polypeptides described herein are introduced into suitable mammalian 
host cells for expression using any of a number of known viral-based systems. In one 
illustrative embodiment, retroviruses provide a convenient and effective platform for 
gene delivery systems. A selected nucleotide sequence encoding a polypeptide of the 

20 present invention can be inserted into a vector and packaged in retroviral particles using 
techniques known in the art. The recombinant virus can then be isolated and delivered 
to a subject. A number of illustrative retroviral systems have been described (e.g., U.S. 
Pat. No. 5,219,740; Miller and Rosman (1989) BioTechniques 7:980-990; Miller, A. D. 
(1990) Human Gene Therapy 1:5-14; Scarpa et al. (1991) Virology 180:849-852; Bums 

25 et al. (1993) Proc. Natl. Acad. Sci. USA 90:8033-8037; and Boris-Lawrie and Temin 
(1993) Cur. Opin. Genet Develop. 3:102-109. 

In addition, a number of illustrative adenovirus-based systems have also 
been described. Unlike retroviruses which integrate into the host genome, adenoviruses 
persist extrachromosomally thus minimizing the risks associated with insertional 

30 mutagenesis (Haj-Ahmad and Graham (1986) J. Virol. 57:267-274; Bett et al. (1993) J. 
Virol. 67:5911-5921; Mittereder et al. (1994) Human Gene Therapy 5:717-729; Seth et 
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al. (1994) J. Virol. 68:933-940; Barr et al. (1994) Gene Therapy 1:51-58; Berkner, K. L. 
(1988) BioTechniques 6:616-629; and Rich et al. (1993) Human Gene Therapy 4:461- 
476). 

Various adeno-associated virus (AAV) vector systems have also been 
5 developed for polynucleotide delivery. AAV vectors can be readily constructed using 
techniques well known in the art. See, e.g., U.S. Pat. Nos. 5,173,414 and 5,139,941; 
International Publication Nos. WO 92/01070 and WO 93/03769; Lebkowski et al. 
(1988) Molec. Cell. Biol. 8:3988-3996; Vincent et al. (1990) Vaccines 90 (Cold Spring 
Harbor Laboratory Press); Carter, B. J. (1992) Current Opinion in Biotechnology 3:533- 

10 539; Muzyczka, N. (1992) Current Topics in Microbiol, and Immunol. 158:97-129; 
Kotin, R. M. (1994) Human Gene Therapy 5:793-801; Shelling and Smith (1994) Gene 
Therapy 1:165-169; and Zhou etal. (1994) J. Exp. Med. 179:1867-1875. 

Additional viral vectors useful for delivering the polynucleotides 
encoding polypeptides of the present invention by gene transfer include those derived 

15 from the pox family of viruses, such as vaccinia virus and avian poxvirus. By way of 
example, vaccinia virus recombinants expressing the novel molecules can be 
constructed as follows. The DNA encoding a polypeptide is first inserted into an 
appropriate vector so that it is adjacent to a vaccinia promoter and flanking vaccinia 
DNA sequences, such as the sequence encoding thymidine kinase (TK). This vector is 

20 then used to transfect cells which are simultaneously infected with vaccinia. 
Homologous recombination serves to insert the vaccinia promoter plus the gene 
encoding the polypeptide of interest into the viral genome. The resulting TK.sup.(-) 
recombinant can be selected by culturing the cells in the presence of 5- 
bromodeoxyuridine and picking viral plaques resistant thereto. 

25 A vaccinia-based infection/transfection system can be conveniently used 

to provide for inducible, transient expression or coexpression of one or more 
polypeptides described herein in host cells of an organism. In this particular system, 
cells are first infected in vitro with a vaccinia virus recombinant that encodes the 
bacteriophage T7 RNA polymerase. This polymerase displays exquisite specificity in 

30 that it only transcribes templates bearing T7 promoters. Following infection, cells are 
transfected with the polynucleotide or polynucleotides of interest, driven by a T7 



81 



WO 02/47534 



PCT/US01/47576 



promoter. The polymerase expressed in the cytoplasm from the vaccinia virus 
recombinant transcribes the transfected DNA into RNA which is then translated into 
polypeptide by the host translational machinery. The method provides for high level, 
transient, cytoplasmic production of large quantities of RNA and its translation 
5 products. See, e.g., Elroy-Stein and Moss, Proc. Natl. Acad. Sci. USA (1990) 87:6743- 
6747; Fuerst et al. Proc. Natl. Acad. Sci. USA (1986) 83:8122-8126. 

Alternatively, avipoxviruses, such as the fowlpox and canarypox viruses, 
can also be used to deliver the coding sequences of interest. Recombinant avipox 
viruses, expressing immunogens from mammalian pathogens, are known to confer 

10 protective immunity when administered to non-avian species. The use of an Avipox 
vector is particularly desirable in human and other mammalian species since members 
of the Avipox genus can only productively replicate in susceptible avian species and 
therefore are not infective in mammalian cells. Methods for producing recombinant 
Avipoxviruses are known in the art and employ genetic recombination, as described 

15 above with respect to the production of vaccinia viruses. See, e.g., WO 91/12882; WO 
89/03429; and WO 92/03545. 

Any of a number of alphavirus vectors can also be used for delivery of 
polynucleotide compositions of the present invention, such as those vectors described in 
U.S. Patent Nos. 5,843,723; 6,015,686; 6,008,035 and 6,015,694. Certain vectors based 

20 on Venezuelan Equine Encephalitis (VEE) can also be used, illustrative examples of 
which can be found in U.S. Patent Nos. 5,505,947 and 5,643,576. 

Moreover, molecular conjugate vectors, such as the adenovirus chimeric 
vectors described in Michael et al. J. Biol. Chem. (1993) 268:6866-6869 and Wagner et 
al. Proc. Natl. Acad. Sci. USA (1992) 89:6099-6103, can also be used for gene delivery 

25 under the invention. 

Additional illustrative information on these and other known viral-based 
delivery systems can be found, for example, in Fisher-Hoch et al., Proc. Natl. Acad. Sci. 
USA 55:317-321, 1989; Flexner et al, Ann. N.Y. Acad. Sci. 5tfP:86-103, 1989; Flexner 
et al., Vaccine 5:17-21, 1990; U.S. Patent Nos. 4,603,112, 4,769,330, and 5,017,487; 

30 WO 89/01973; U.S. Patent No. 4,777,127; GB 2,200,651; EP 0,345,242; 
WO 91/02805; Berkner, Biotechniques 5:616-627, 1988; Rosenfeld et al., Science 
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252:431-434, 1991; Kolls et al., Proc. Natl Acad Set USA 97:215-219, 1994; 
Kass-Eisler et al., Proc. Natl Acad Sci. USA 90:11498-11502, 1993; Guzman et al., 
Circulation 55:2838-2848, 1993; and Guzman et aL, Cir. Res. 73:1202-1207, 1993. 

In certain embodiments, a polynucleotide may be integrated into the 
5 genome of a target cell. This integration may be in the specific location and orientation 
via homologous recombination (gene replacement) or it may be integrated in a random, 
non-specific location (gene augmentation). In yet further embodiments, the 
polynucleotide may be stably maintained in the cell as a separate, episomal segment of 
DNA. Such polynucleotide segments or "episomes" encode sequences sufficient to 

10 permit maintenance and replication independent of or in synchronization with the host 
cell cycle. The maimer in which the expression construct is delivered to a cell and 
where in the cell the polynucleotide remains is dependent on the type of expression 
construct employed. 

In another embodiment of the invention, a polynucleotide is 

15 administered/delivered as "naked" DNA, for example as described in Ulmer et al., 
Science 259:1745-1749, 1993 and reviewed by Cohen, Science 259:1691-1692, 1993. 
The uptake of naked DNA may be increased by coating the DNA onto biodegradable 
beads, which are efficiently transported into the cells. 

In still another embodiment, a composition of the present invention can 

20 be delivered via a particle bombardment approach, many of which have been described. 
In one illustrative example, gas-driven particle acceleration can be achieved with 
devices such as those manufactured by Powderject Pharmaceuticals PLC (Oxford, UK) 
and Powderject Vaccines Inc. (Madison, WI), some examples of which are described in 
U.S. Patent Nos. 5,846,796; 6,010,478; 5,865,796; 5,584,807; and EP Patent No. 0500 

25 799. This approach offers a needle-free delivery approach wherein a dry powder 
formulation of microscopic particles, such as polynucleotide or polypeptide particles, 
are accelerated to high speed within a helium gas jet generated by a hand held device, 
propelling the particles into a target tissue of interest. 

In a related embodiment, other devices and methods that may be useful 

30 for gas-driven needle-less injection of compositions of the present invention include 
those provided by Bioject, Inc. (Portland, OR), some examples of which are described 
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in U.S. Patent Nos. 4,790,824; 5,064,413; 5,312,335; 5,383,851; 5,399,163; 5,520,639 
and 5,993,412. 

According to another embodiment, the pharmaceutical compositions 
described herein will comprise one or more immunostimulants in addition to the 
5 immunogenic polynucleotide, polypeptide, antibody, T-cell and/or APC compositions 
of this invention. An immunostimulant refers to essentially any substance that enhances 
or potentiates an immune response (antibody and/or cell-mediated) to an exogenous 
antigen. One preferred type of immunostimulant comprises an adjuvant , Many 
adjuvants contain a substance designed to protect the antigen from rapid catabolism, 

10 such as aluminum hydroxide or mineral oil, and a stimulator of immune responses, such 
as lipid A, Bortadella pertussis or Mycobacterium tuberculosis derived proteins. 
Certain adjuvants are commercially available as, for example, Freund's Incomplete 
Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, MI); Merck Adjuvant 
65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline Beecham, Philadelphia, 

15 PA); aluminum salts such as aluminum hydroxide gel (alum) or aluminum phosphate; 
salts of calcium, iron or zinc; an insoluble suspension of acylated tyrosine; acylated 
sugars; cationically or anionically derivatized polysaccharides; polyphosphazenes; 
biodegradable microspheres; monophosphoryl lipid A and quil A. Cytokines, such as 
GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be used as 

20 adjuvants. 

Within certain embodiments of the invention, the adjuvant composition 
is preferably one that induces an immune response predominantly of the Thl type. High 
levels of Thl-type cytokines (e.g., IFN-y, TNFa, IL-2 and EL-12) tend to favor the 
induction of cell mediated immune responses to an administered antigen. In contrast, 

25 high levels of Th2-type cytokines {e.g., IL-4, DL-5, IL-6 and IL-10) tend to favor the 
induction of humoral immune responses. Following application of a vaccine as 
provided herein, a patient will support an immune response that includes Thl- and Th2- 
type responses. Within a preferred embodiment, in which a response is predominantly 
Thl-type, the level of Thl-type cytokines will increase to a greater extent than the level 

30 of Th2-type cytokines. The levels of these cytokines may be readily assessed using 
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standard assays. For a review of the families of cytokines, see Mosmann and Cofiman, 
Ann. Rev. Immunol. 7:145-173, 1989. 

Certain preferred adjuvants for eliciting a predominantly Thl-type 
response include, for example, a combination of monophosphoryl lipid A, preferably 3- 
5 de-O-acylated monophosphoryl lipid A, together with an aluminum salt MPL® 
adjuvants are available from Corixa Corporation (Seattle, WA; see, for example, US 
Patent Nos. 4,436,727; 4,877,611; 4,866,034 and 4,912,094). CpG-containing 
oligonucleotides (in which the CpG dinucleotide is unmethylated) also induce a 
predominantly Thl response. Such oligonucleotides are well known and are described, 

10 for example, in WO 96/02555, WO 99/33488 and U.S. Patent Nos. 6,008,200 and 
5,856,462. Immunostimulatory DNA sequences are also described, for example, by 
Sato et aL, Science 273:352, 1996. Another preferred adjuvant comprises a saponin, 
such as Quil A, or derivatives thereof, including QS21 and QS7 (Aquila 
Biopharmaceuticals Inc., Framingham, MA); Escin; Digitonin; or Gypsophila or 

15 Chenopodium quinoa saponins . Other preferred formulations include more than one 
saponin in the adjuvant combinations of the present invention, for example 
combinations of at least two of the following group comprising QS21, QS7, Quil A, p- 
escin, or digitonin. 

Alternatively the saponin formulations may be combined with vaccine 

20 vehicles composed of chitosan or other polycationic polymers, polylactide and 
polylactide-co-glycolide particles, poly-N-acetyl glucosamine-based polymer matrix, 
particles composed of polysaccharides or chemically modified polysaccharides, 
liposomes and lipid-based particles, particles composed of glycerol monoesters, etc. The 
saponins may also be formulated in the presence of cholesterol to form particulate 

25 structures such as liposomes or ISCOMs. Furthermore, the saponins may be formulated 
together with a polyoxyethylene ether or ester, in either a non-particulate solution or 
suspension, or in a particulate structure such as a paucilamelar liposome or ISCOM. The 
saponins may also be formulated with excipients such as Carbopol R to increase 
viscosity, or may be formulated in a dry powder form with a powder excipient such as 

30 lactose. 
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In one preferred embodiment, the adjuvant system includes the 
combination of a monophosphoiyl lipid A and a saponin derivative, such as the 
combination of QS21 and 3D-MPL® adjuvant, as described in WO 94/00153, or a less 
reactogenic composition where the QS21 is quenched with cholesterol, as described in 
5 WO 96/33739. Other preferred formulations comprise an oil-in-water emulsion and 
tocopherol. Another particularly preferred adjuvant formulation employing QS21, 3D- 
MPL® adjuvant and tocopherol in an oil-in-water emulsion is described in WO 
95/17210. 

Another enhanced adjuvant system involves the combination of a CpG- 
10 containing oligonucleotide and a saponin derivative particularly the combination of 
CpG and QS21 is disclosed in WO 00/09159. Preferably the formulation additionally 
comprises an oil in water emulsion and tocopherol. 

Additional illustrative adjuvants for use in the pharmaceutical 
compositions of the invention include Montanide ISA 720 (Seppic, France), SAP 
15 (Chiron, California, United States), ISCOMS (CSL), MF-59 (Chiron), the SBAS series 
of adjuvants (e.g., SBAS-2 or SBAS-4, available from SmithKline Beecham, Rixensart, 
Belgium), Detox (Enhanzyn®) (Corixa, Hamilton, MT), RC-529 (Corixa, Hamilton, 
MT) and other aminoalkyl glucosaminide 4-phosphates (AGPs), such as those described 
in pending U.S. Patent Application Serial Nos. 08/853,826 and 09/074,720, the 
20 disclosures of which are incorporated herein by reference in their entireties, and 
polyoxyethylene ether adjuvants such as those described in WO 99/52549A1. 

Other preferred adjuvants include adjuvant molecules of the general 

formula 

(T): HO(CH 2 CH 2 0) n -A-R, 
25 wherein, n is 1-50, A is a bond or -C(O)-, R is C1-50 alkyl or Phenyl C1-50 alkyl. 

One embodiment of the present invention consists of a vaccine 
formulation comprising a polyoxyethylene ether of general formula (I), wherein n is 
between 1 and 50, preferably 4-24, most preferably 9; the R component is C1.50, 
preferably C4-C20 alkyl and most preferably C12 alkyl, and A is a bond. The 
30 concentration of the polyoxyethylene ethers should be in the range 0.1-20%, preferably 
from 0.1-10%, and most preferably in the range 0.1-1%. Preferred polyoxyethylene 
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ethers are selected from the following group: polyoxyethylene-9-lauryl ether, 
polyoxyethylene-9-steoryl ether, polyoxyethylene-8-steoryl ether, polyoxyethylene-4- 
lauryl ether, polyoxyethylene-35-lauryl ether, and polyoxyethylene-23-lauryl ether. 
Polyoxyethylene ethers such as polyoxyethylene lauryl ether are described in the Merck 
5 index (12 th edition: entry 7717). These adjuvant molecules are described in WO 
99/52549. 

The polyoxyethylene ether according to the general formula (I) above 
may, if desired, be combined with another adjuvant For example, a preferred adjuvant 
combination is preferably with CpG as described in the pending UK patent application 

10 GB 9820956.2. 

According to . another embodiment of this invention, an immunogenic 
composition described herein is delivered to a host via antigen presenting cells (APCs), 
such as dendritic cells, macrophages, B cells, monocytes and other cells that may be 
engineered to be efficient APCs. Such cells may, but need not, be genetically modified 

15 to increase the capacity for presenting the antigen, to improve activation and/or 
maintenance of the T cell response, to have anti-tumor effects per se and/or to be 
immunologically compatible with the receiver (i.e., matched HLA haplotype). APCs 
may generally be isolated from any of a variety of biological fluids and organs, 
including tumor and peritumoral tissues, and may be autologous, allogeneic, syngeneic 

20 or xenogeneic cells. 

Certain preferred embodiments of the present invention use dendritic 
cells or progenitors thereof as antigen-presenting cells. Dendritic cells are highly potent 
APCs (Banchereau and Steinman, Nature 5P2:245-251, 1998) and have been shown to 
be effective as a physiological adjuvant for eliciting prophylactic or therapeutic 

25 antitumor immunity (see Timmerman and Levy, Ann, Rev. Med 50:507-529, 1999). In 
general, dendritic cells may be identified based on their typical shape (stellate in situ, 
with marked cytoplasmic processes (dendrites) visible in vitro), their ability to take up, 
process and present antigens with high efficiency and their ability to activate naive T 
cell responses. Dendritic cells may, of course, be engineered to express specific cell- 

30 surface receptors or ligands that are not commonly found on dendritic cells in vivo or ex 
vivo, and such modified dendritic cells are contemplated by the present invention. As 
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an alternative to dendritic cells, secreted vesicles antigen-loaded dendritic cells (called 
exosomes) may be used within a vaccine (see Zitvogel et aL, Nature Med 4:594-600, 
1998). 

Dendritic cells and progenitors may be obtained from peripheral blood, 
5 bone marrow, tumor-infiltrating cells, peritumoral tissues-infiltrating cells, lymph 
nodes, spleen, skin, umbilical cord blood or any other suitable tissue or fluid. For 
example, dendritic cells may be differentiated ex vivo by adding a combination of 
cytokines such as GM-CSF, IL-4, EL-13 and/or TNFa to cultures of monocytes 
harvested from peripheral blood. Alternatively, CD34 positive cells harvested from 

10 peripheral blood, umbilical cord blood or bone marrow may be differentiated into 
dendritic cells by adding to the culture medium combinations of GM-CSF, IL-3, TNFa, 
CD40 ligand, LPS, flt3 ligand and/or other compound(s) that induce differentiation, 
maturation and proliferation of dendritic cells. 

Dendritic cells are conveniently categorized as "immature" and "mature" 

15 cells, which allows a simple way to discriminate between two well characterized 
phenotypes. However, this nomenclature should not be construed to exclude all 
possible intermediate stages of differentiation. Immature dendritic cells are 
characterized as APC with a high capacity for antigen uptake and processing, which 
' correlates with the high expression of Fey receptor and mannose receptor. The mature 

20 phenotype is typically characterized by a lower expression of these markers, but a high 
expression of cell surface molecules responsible for T cell activation such as class I and 
class II MHC, adhesion molecules (e.g., CD54 and GDI 1) and costimulatory molecules 
(e.g., CD40, CD80, CD86 and4-lBB). 

APCs may generally be transfected with a polynucleotide of the 

25 invention (or portion or other variant thereof) such that the encoded polypeptide, or an 
immunogenic portion thereof, is expressed on the cell surface. Such transfection may 
take place ex vivo, and a pharmaceutical composition comprising such transfected cells 
may then be used for therapeutic purposes, as described herein. Alternatively, a gene 
delivery vehicle that targets a dendritic or other antigen presenting cell may be 

30 administered to a patient, resulting in transfection that occurs in vivo. In vivo and ex 
vivo transfection of dendritic cells, for example, may generally be performed using any 
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methods known in the art, such as those described in WO 97/24447, or the gene gun 
approach described by Mahvi et al., Immunology and cell Biology 75:456-460, 1997. 
Antigen loading of dendritic cells may be achieved by incubating dendritic cells or 
progenitor cells with the tumor polypeptide, DNA (naked or within a plasmid vector) or 
5 RNA; or with antigen-expressing recombinant bacterium or viruses (e.g., vaccinia, 
fowlpox, adenovirus or lentivirus vectors). Prior to loading, the polypeptide may be 
covalently conjugated to an immunological partner that provides T cell help (e.g., a 
carrier molecule). Alternatively, a dendritic cell may be pulsed with a non-conjugated 
immunological partner, separately or in the presence of the polypeptide. 

10 While any suitable carrier known to those of ordinary skill in the art may 

be employed in the pharmaceutical compositions of this invention, the type of carrier 
will typically vary depending on the mode of administration. Compositions of the 
present invention may be formulated for any appropriate manner of administration, 
including for example, topical, oral, nasal, mucosal, intravenous, intracranial, 

1 5 intraperitoneal, subcutaneous and intramuscular administration. 

Carriers for use within such pharmaceutical compositions are 
biocompatible, and may also be biodegradable. In certain embodiments, the 
formulation preferably provides a relatively constant level of active component release. 
In other embodiments, however, a more rapid rate of release immediately mpon 

20 administration may be desired. The formulation of such compositions is well within the 
level of ordinary skill in the art using known techniques. Illustrative carriers useful in 
this regard include microparticles of poly(lactide-co-glycolide), polyacrylate, latex, 
starch, cellulose, dextran and the like. Other illustrative delayed-release carriers 
include supramolecular biovectors, which comprise a non-liquid hydrophilic core (e.g., 

25 a cross-linked polysaccharide or oligosaccharide) and, optionally, an external layer 
comprising an amphiphilic compound, such as a phospholipid (see e.g., U.S. Patent No. 
5,151,254 and PCT applications WO 94/20078, WO/94/23701 and WO 96/06638). The 
amount of active compound contained within a sustained release formulation depends 
upon the site of implantation, the rate and expected duration of release and the nature of 

30 the condition to be treated or prevented. 
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In another illustrative embodiment, biodegradable microspheres (e.g., 
polylactate polyglycolate) are employed as carriers for the compositions of this 
invention. Suitable biodegradable microspheres are disclosed, for example, in U.S. 
Patent Nos. 4,897,268; 5,075,109; 5,928,647; 5,811,128; 5,820,883; 5,853,763; 
5 5,814,344, 5,407,609 and 5,942,252. Modified hepatitis B core protein carrier systems, 
such as described in WO/99 40934, and references cited therein, will also be useful for 
many applications. Another illustrative carrier/delivery system employs a carrier 
comprising particulate-protein complexes, such as those described in U.S. Patent No. 
5,928,647, which are capable of inducing a class I-restricted cytotoxic T lymphocyte 

1 0 responses in a host 

The pharmaceutical compositions of the invention will often further 
comprise one or more buffers (e.g., neutral buffered saline or phosphate buffered 
saline), carbohydrates (e.g., glucose, mannose, sucrose or dextrans), mannitol, proteins, 
polypeptides or amino acids such as glycine, antioxidants, bacteriostats, chelating 

15 agents such as EDTA or glutathione, adjuvants (e.g., aluminum hydroxide), solutes that 
render the formulation isotonic, hypotonic or weakly hypertonic with the blood of a 
recipient, suspending agents, thickening agents and/or preservatives. Alternatively, 
compositions of the present invention may be formulated as a lyophilizate. 

The pharmaceutical compositions described herein may be presented in 

20 unit-dose or multi-dose containers, such as sealed ampoules or vials. Such containers 
are typically sealed in such a way to preserve the sterility and stability of the 
formulation until use. In general, formulations may be stored as suspensions, solutions 
or emulsions in oily or aqueous vehicles. Alternatively, a pharmaceutical composition 
may be stored in a freeze-dried condition requiring only the addition of a sterile liquid 

25 carrier immediately prior to use. 

The development of suitable dosing and treatment regimens for using the 
particular compositions described herein in a variety of treatment regimens, including 
e.g., oral, parenteral, intravenous, intranasal, and intramuscular administration and 
formulation, is well known in the art, some of which are briefly discussed below for 

30 general purposes of illustration. 
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In certain applications, the pharmaceutical compositions disclosed herein 
may be delivered via oral administration to an animal. As such, these compositions 
may be formulated with an inert diluent or with an assimilable edible carrier, or they 
may be enclosed in hard- or soft-shell gelatin capsule, or they may be compressed into 
5 tablets, or they may be incorporated directly with the food of the diet * 

The active compounds may even be incorporated with excipients and 
used in the form of ingestible tablets, buccal tables, troches, capsules, elixirs, 
suspensions, syrups, wafers, and the like (see, for example, Mathiowitz et aL, Nature 
1997 Mar 27;386(6623):410-4; Hwang et ah, Crit Rev Ther Drug Carrier Syst 

10 1998;15(3):243-84; U. S. Patent 5,641,515; U. S. Patent 5,580,579 and U. S. Patent 
5,792,451). Tablets, troches, pills, capsules and the like may also contain any of a 
variety of additional components, for example, a binder, such as gum tragacanth, acacia, 
cornstarch, or gelatin; excipients, such as dicalcium phosphate; a disintegrating agent, 
such as corn starch, potato starch, alginic acid and the like; a lubricant, such as 

15 magnesium stearate; and a sweetening agent, such as sucrose, lactose or saccharin may 
be added or a flavoring agent, such as peppermint, oil of wintergreen, or cherry 
flavoring. When the dosage unit form is a capsule, it may contain, in addition to 
materials of the above type, a liquid carrier. Various other materials may be present as 
coatings or to otherwise modify the physical form of the dosage unit For instance, 

20 tablets, pills, or capsules may be coated with shellac, sugar, or both. Of course, any 
material used in preparing any dosage unit form should be pharmaceutically pure and 
substantially non-toxic in the amounts employed. In addition, the active compounds 
may be incorporated into sustained-release preparation and formulations. 

Typically, these formulations will contain at least about 0.1% of the 

25 active compound or more, although the percentage of the active ingredient(s) may, of 
course, be varied and may conveniently be between about 1 or 2% and about 60% or 
70% or more of the weight or volume of the total formulation. Naturally, the amount of 
active compound(s) in each therapeutically useful composition may be prepared is such 
a way that a suitable dosage will be obtained in any given unit dose of the compound. 

30 Factors such as solubility, bioavailability, biological half-life, route of administration, 
product shelf life, as well as other pharmacological considerations will be contemplated 

i 
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by one skilled in the art of preparing such pharmaceutical formulations, and as such, a 
variety of dosages and treatment regimens may be desirable. 

For oral administration the compositions of the present invention may 
alternatively be incorporated with one or more excipients in the form of a mouthwash, 
5 dentifrice, buccal tablet, oral spray, or sublingual orally-administered formulation. 
Alternatively, the active ingredient may be incorporated into an oral solution such as 
one containing sodium borate, glycerin and potassium bicarbonate, or dispersed in a 
dentifrice, or added in a therapeutically-effective amount to a composition that may 
include water, binders, abrasives, flavoring agents, foaming agents, and humectants. 

10 Alternatively the compositions may be fashioned into a tablet or solution form Uiat may 
be placed under the tongue or otherwise dissolved in the mouth. 

In certain circumstances it will be desirable to deliver the pharmaceutical 
compositions disclosed herein parenterally, intravenously, intramuscularly, or even 
intraperitoneally. Such approaches are well known to the skilled artisan, some of which 

15 are further described, for example, in U. S. Patent 5,543,158; U. S. Patent 5,641,515 
and U. S. Patent 5,399,363. In certain embodiments, solutions of the active compounds 
as free base or pharmacologically acceptable salts may be prepared in water suitably 
mixed with a surfactant, such as hydroxypropylcellulose. Dispersions may also be 
prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. 

20 Under ordinary conditions of storage and use, these preparations generally will contain a 
preservative to prevent the growth of microorganisms. 

Illustrative pharmaceutical forms suitable for injectable use include 
sterile aqueous solutions or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersions (for example, see U. S. Patent 

25 5,466,468). In all cases the form must be sterile and must be fluid to the extent that 
easy syringability exists. It must be stable under the conditions of manufacture and 
storage and must be preserved against the contaminating action of microorganisms, 
such as bacteria and fungi. The carrier can be a solvent or dispersion medium 
containing, for example, water, ethanol, polyol {e.g., glycerol, propylene glycol, and 

30 liquid polyethylene glycol, and the like), suitable mixtures thereof, and/or vegetable 
oils. Proper fluidity may be maintained, for example, by the use of a coating, such as 
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lecithin, by the maintenance of the required particle size in the case of dispersion and/or 
by the use of surfactants. The prevention of the action of microorganisms can be 
facilitated by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be 
5 preferable to include isotonic agents, for example, sugars or sodium chloride. 
Prolonged absorption of the injectable compositions can be brought about by the use in 
the compositions of agents delaying absorption, for example, aluminum monostearate 
and gelatin. 

In one embodiment, for parenteral administration in an aqueous solution, 

10 the solution should be suitably buffered if necessary and the liquid diluent first rendered 
isotonic with sufficient saline or glucose. These particular aqueous solutions are 
especially suitable for intravenous, intramuscular, subcutaneous and intraperitoneal 
administration. In this connection, a sterile aqueous medium that can be employed will 
be known to those of skill in the art in light of the present disclosure. For example, one 

15 dosage may be dissolved in 1 ml of isotonic NaCl solution and either added to 1000 ml 
of hypodermoclysis fluid or injected at the proposed site of infusion, (see for example, 
"Remington's Pharmaceutical Sciences" 15th Edition, pages 1035-1038 and 1570- 
1580). Some variation in dosage will necessarily occur depending on the condition of 
the subject being treated. Moreover, for human administration, preparations will of 

20 course preferably meet sterility, pyrogenicity, and the general safety and purity 
standards as required by FDA Office of Biologies standards. 

In another embodiment of the invention, the compositions disclosed 
herein may be formulated in a neutral or salt form: Illustrative 
pharmaceutically-acceptable salts include the acid addition salts (formed with the free 

25 amino groups of the protein) and which are formed with inorganic acids such as, for 
example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, 
tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can also be 
derived from inorganic bases such as, for example, sodium, potassium, ammonium, 
calcium, or ferric hydroxides, and such organic bases as isopropylamine, 

30 trimethylamine, histidine, procaine and the like. Upon formulation, solutions will be 
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administered in a manner compatible with the dosage formulation and in such amount 
as is therapeutically effective. 

The carriers can further comprise any and all solvents, dispersion media, 
vehicles, coatings, diluents, antibacterial and antifungal agents, isotonic and absorption 
5 delaying agents, buffers, carrier solutions, suspensions, colloids, and the like. The use 
of such media and agents for pharmaceutical active substances is well known in the art. 
Except insofar as any conventional media or agent is incompatible with the active 
ingredient, its use in the therapeutic compositions is contemplated. Supplementary 
active ingredients can also be incorporated into the compositions. The phrase 

10 "pharmaceutically-acceptable" refers to molecular entities and compositions that do not 
produce an allergic or similar untoward reaction when administered to a human. 

In certain embodiments, the pharmaceutical compositions may be 
delivered by intranasal sprays, inhalation, and/or other aerosol delivery vehicles. 
Methods for delivering genes, nucleic acids, and peptide compositions directly to the 

15 lungs via nasal aerosol sprays has been described, e.g., in U. S. Patent 5,756,353 and U. 
S. Patent 5,804,212. Likewise, the delivery of drugs using intranasal microparticle 
resins (Takenaga et al 9 J Controlled' Release 1998 Mar 2;52(l-2):81-7) ■ and 
lysophosphatidyl-glycerol compounds (U. S. Patent 5,725,871) are also well-known in 
the pharmaceutical arts. Likewise, illustrative transmucosal drug delivery in the form of 

20 a polytetrafluoroetheylene support matrix is described in U. S. Patent 5,780,045. 

In certain embodiments, liposomes, nanocapsules, microparticles, lipid 
particles, vesicles, and the like, are used for the introduction of the compositions of the 
present invention into suitable host cells/organisms. In particular, the compositions of 
the present invention may be formulated for delivery either encapsulated in a lipid 

25 particle, a liposome, a vesicle, a nanosphere, or a nanoparticle or the like. Alternatively, 
compositions of the present invention can be bound, either covalently or non-covalently, 
to the surface of such carrier vehicles. 

The formation and use of liposome and liposome-like preparations as 
potential drug carriers is generally known to those of skill in the art (see for example, 

30 Lasic, Trends Biotechnol 1998 Jul;16(7):307-21; Takakura, Nippon Rinsho 1998 
Mar;56(3):691-5; Chandran et a/., Indian J Exp Biol. 1997 Aug;35(8):801-9; Margalit, 
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Crit Rev Ther Drug Carrier Syst. 1995;12(2-3):233-61; U.S. Patent 5,567,434; U.S. 
Patent 5,552,157; U.S. Patent 5,565,213; U.S. Patent 5,738,868 and U.S. Patent 
5,795,587, each specifically incorporated herein by reference in its entirety). 

Liposomes have been used successfully with a number of cell types that 
5 are normally difficult to transfect by other procedures, including T cell suspensions, 
primary hepatocyte cultures and PC 12 cells (Renneisen et al, J Biol Chem. 1990 Sep 
25;265(27): 16337-42; Muller et aL, DNA Cell Biol. 1990 Apr;9(3):221-9). In addition, 
liposomes are free of the DNA length constraints that are typical of viral-based delivery 
systems. Liposomes have been used effectively to introduce genes, various drugs, 
10 radiotherapeutic agents, enzymes, viruses, transcription factors, allosteric effectors and 
the like, into a variety of cultured cell lines and animals. Furthermore, he use of 
liposomes does not appear to be associated with autoimmune responses or unacceptable 
toxicity after systemic delivery. 

In certain embodiments, liposomes are formed from phospholipids that 
15 are dispersed in an aqueous medium and spontaneously form multilamellar concentric 
bilayer vesicles (also termed multilamellar vesicles (MLVs). 

Alternatively, in other embodiments, the invention provides for 
pharmaceutically-acceptable nanocapsule formulations of the compositions of the 
present invention. Nanocapsules can generally entrap compounds in a stable and 
20 reproducible way (see, for example, Quintanar-Guerrero et al, Drug Dev Ind Pharm. 
1998 Dec;24(12):l 113-28). To avoid side effects due to intracellular polymeric 
overloading, such ultrafine particles (sized around 0.1 \wx) may be designed using 
polymers able to be degraded in vivo. Such particles can be made as described, for 
example, by Couvreur etal, Crit Rev Ther Drug Carrier Syst. 1988;5(l):l-20; zur 
25 Muhlen et al, Eur J Pharm Biophann. 1998 Mar;45(2):149-55; Zambaux et al J 
Controlled Release. 1998 Jan2;50(l-3):31-40; andU. S. Patent 5,145,684. 

Cancer Therapeutic Methods 

Immunologic approaches to cancer therapy are based on the recognition 
that cancer cells can often evade the body's defenses against aberrant or foreign cells 
30 and molecules, and that these defenses might be therapeutically stimulated to regain the 
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lost ground, e.g. pgs. 623-648 in Klein, Immunology (Wiley-Interscience, New York, 
1982). Numerous recent observations that various immune effectors can directly or 
indirectly inhibit growth of tumors has led to renewed interest in this approach to cancer 
therapy, e.g. Jager, et aL, Oncology 2001 ;60(1): 1-7; Renner, et aL, Ann Hematol 2000 
5 Dec;79(12):651-9. 

Four-basic cell types whose function has been associated with antitumor 
cell immunity and the elimination of tumor cells from the body are: i) B-lymphocytes 
which secrete immunoglobulins into the blood plasma for identifying and labeling the 
nonself invader cells; ii) monocytes which secrete the complement proteins that are 

10 responsible for lysing and processing the immunoglobulin-coated target invader cells; 
iii) natural killer lymphocytes having two mechanisms for the destruction of tumor 
cells, antibody-dependent cellular cytotoxicity and natural killing; and iv) T- 
lymphocytes possessing antigen-specific receptors and having the capacity to recognize 
a tumor cell carrying complementary marker molecules (Schreiber, H., 1989, in 

15 Fundamental Immunology (ed). W. E. Paul, pp. 923-955). 

Cancer immunotherapy generally focuses on inducing humoral immune 
responses, cellular immune responses, or both. Moreover, it is well established that 
induction of CD4 + T helper cells is necessary in order to secondarily induce either 
antibodies or cytotoxic CD8 + T cells. Polypeptide antigens that are selective or ideally 

20 specific for cancer cells, particularly lung cancer cells, offer a powerful approach for 
inducing immune responses against lung cancer, and are an important aspect of the 
present invention. 

Therefore, in further aspects of the present invention, the pharmaceutical 
compositions described herein may be used for the treatment of cancer, particularly for 

25 the immunotherapy of lung cancer. Within such methods, the pharmaceutical 
compositions described herein are administered to a patient, typically a warm-blooded 
animal, preferably a human. A patient may or may not be afflicted with cancer. 
Accordingly, the above pharmaceutical compositions may be used to prevent the 
development of a cancer or to treat a patient afflicted with a cancer. Pharmaceutical 

30 compositions and vaccines may be administered either prior to or following surgical 
removal of primary tumors and/or treatment such as administration of radiotherapy or 
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conventional chemotherapeutic drugs. As discussed above, administration of the 
pharmaceutical compositions may be by any suitable method, including administration 
by intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal, intradermal, 
anal, vaginal, topical and oral routes. 
5 Within certain embodiments, immunotherapy may be active 

immunotherapy, in which treatment relies on the in vivo stimulation of the endogenous 
host immune system to react against tumors with the administration of immune 
response-modifying agents (such as polypeptides and polynucleotides as provided 
herein). 

10 Within other embodiments, immunotherapy may be passive 

immunotherapy, in which treatment involves the delivery of agents with established 
tumor-immune reactivity (such as effector cells or antibodies) that can directly or 
indirectly mediate antitumor effects and does not necessarily depend on an intact host 
immune system. Examples of effector cells include T cells as discussed above, T 

15 lymphocytes (such as CD8 + cytotoxic T lymphocytes and CD4 + T-helper tumor- 
infiltrating lymphocytes), killer cells (such as Natural Killer cells and lymphokine- 
activated killer cells), B cells and antigen-presenting cells (such as dendritic cells and 
macrophages) expressing a polypeptide provided herein. T cell receptors and antibody 
receptors specific for the polypeptides recited herein may be cloned, expressed and 

20 transferred into other vectors or effector cells for adoptive immunotherapy. The 
polypeptides provided herein may also be used to generate antibodies or anti-idiotypic 
antibodies (as described above and in U.S. Patent No. 4,918,164) for passive 
immunotherapy. 

Monoclonal antibodies may be labeled with any of a variety of labels for 
25 desired selective usages in detection, diagnostic assays or therapeutic applications (as 
described in U.S. Patent Nos. 6,090,365; 6,015,542; 5,843,398; 5,595,721; and 
4,708,930, hereby incorporated by reference in their entirety as if each was incorporated 
individually). In each case, the binding of the labelled monoclonal antibody to the 
determinant site of the antigen will signal detection or delivery of a particular 
30 therapeutic agent to the antigenic determinant on the non-normal cell. A further object 
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of this invention is to provide the specific monoclonal antibody suitably labelled for 
achieving such desired selective usages thereof. 

Effector cells may generally be obtained in sufficient quantities for 
adoptive immunotherapy by growth in vitro, as described herein. Culture conditions for 
5 expanding single antigen-specific effector cells to several billion in number with 
retention of antigen recognition in vivo are well known in the art. Such in vitro culture 
conditions typically use intermittent stimulation with antigen, often in the presence of 
cytokines (such as IL-2) and non-dividing feeder cells. As noted above, 
immunoreactive polypeptides as provided herein may be used to rapidly expand 

10 antigen-specific T cell cultures in order to generate a sufficient number of cells for 
immunotherapy. In particular, antigen-presenting cells, such as dendritic, macrophage, 
monocyte, fibroblast and/or B cells, may be pulsed with immunoreactive polypeptides 
or transfected with one or more polynucleotides using standard techniques well known 
in the art. For example, antigen-presenting cells can be transfected with a 

15 polynucleotide having a promoter appropriate for increasing expression in a 
recombinant virus or other expression system. Cultured effector cells for use in therapy 
must be able to grow and distribute widely, and to survive long term in vivo. Studies 
have shown that cultured effector cells can be induced to grow in vivo and to survive 
long term in substantial numbers by repeated stimulation with antigen supplemented 

20 with IL-2 (see, for example, Cheever et al., Immunological Reviews J 57:177, 1997). 

Alternatively, a vector expressing a polypeptide recited herein may be 
introduced into antigen presenting cells taken from a patient and clonally propagated ex 
vivo for transplant back into the same patient Transfected cells may be reintroduced 
into the patient using any means known in the art, preferably in sterile form by 

25 intravenous, intracavitary, intraperitoneal or intratumor administration. 

Routes and frequency of administration of the therapeutic compositions 
described herein, as well as dosage, will vary from individual to individual, and may be 
readily established using standard techniques. In general, the pharmaceutical 
compositions and vaccines may be administered by injection (e.g., intracutaneous, 

30 intramuscular, intravenous or subcutaneous), intranasally (e.g., by aspiration) or orally. 
Preferably, between 1 and 10 doses may be administered over a 52 week period. 
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Preferably, 6 doses are administered, at intervals of 1 month, and booster vaccinations 
may be given periodically thereafter. Alternate protocols may be appropriate for 
individual patients. A suitable dose is an amount of a compound that, when 
administered as described above, is capable of promoting an anti-tumor immune 
5 response, and is at least 10-50% above the basal (i.e., untreated) level. Such response 
can be monitored by measuring die anti-tumor antibodies in a patient or by vaccine- 
dependent generation of cytolytic effector cells capable of killing the patient's tumor 
cells in vitro. Such vaccines should also be capable of causing an immune response that 
leads to an improved clinical outcome (e.g., more frequent remissions, complete or 

10 partial or longer disease-free survival) in vaccinated patients as compared to non- 
vaccinated patients. In general, for pharmaceutical compositions and vaccines 
comprising one or more polypeptides, the amount of each polypeptide present in a dose 
ranges from about 25 jxg to 5 mg per kg of host. Suitable dose sizes will vary with the 
size of the patient, but will typically range from about 0. 1 mL to about 5 mL. 

15 In general, an appropriate dosage and treatment regimen provides the 

active compound(s) in an amount sufficient to provide therapeutic and/or prophylactic 
benefit Such a response can be monitored by establishing an improved clinical 
outcome (e.g., more frequent remissions, complete or partial, or longer disease-free 
survival) in treated patients as compared to non-treated patients. Increases in 

20 preexisting immune responses to a tumor protein generally correlate with an improved 
clinical outcome. Such immune responses may generally be evaluated using standard 
proliferation, cytotoxicity or cytokine assays, which may be performed using samples 
obtained from a patient before and after treatment. 

Cancer Detection and Diagnostic Compositions, Methods and Kits 
25 In general, a cancer may be detected in a patient based on the presence of 

one or more lung tumor proteins and/or polynucleotides encoding such proteins in a 
biological sample (for example, blood, sera, sputum urine and/or tumor biopsies) 
obtained from the patient. In other words, such proteins may be used as markers to 
indicate the presence or absence of a cancer such as lung cancer. In addition, such 
30 proteins may be useful for the detection of other cancers. The binding agents provided 
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herein generally permit detection of the level of antigen that binds to the agent in the 
biological sample. 

Polynucleotide primers and probes may be used to detect the level of 
mRNA encoding a tumor protein, which is also indicative of the presence or absence of 
5 a cancer. In general, a tumor sequence should be present at a level that is at least two- 
fold, preferably three-fold, and more preferably five-fold or higher in tumor tissue than 
in normal tissue of the same type from which the tumor arose. Expression levels of a 
particular tumor sequence in tissue types different from that in which the tumor arose 
are irrelevant in certain diagnostic embodiments since the presence of tumor cells can 

10 be confirmed by observation of predetermined differential expression levels, e.g., 2- 
fold, 5-fold, etc, in tumor tissue to expression levels in normal tissue of the same type. 

Other differential expression patterns can be utilized advantageously for 
diagnostic purposes. For example, in one aspect of the invention, overexpression of a 
tumor sequence in tumor tissue and normal tissue of the same type, but not in other 

15 normal tissue types, e.g. PBMCs, can be exploited diagnostically. In this case, the 
presence of metastatic tumor cells, for example in a sample taken from the circulation 
or some other tissue site different from that in which the tumor arose, can be identified 
and/or confirmed by detecting expression of the tumor sequence in the sample, for 
example using RT-PCR analysis. In many instances, it will be desired to enrich for 

20 tumor cells in the sample of interest, e.g., PBMCs, using cell capture or other like 
techniques. 

There are a variety of assay formats known to those of ordinary skill in 
the art for using a binding agent to detect polypeptide markers in a sample. See, e.g., 
Harlow and Lane, Antibodies: A Laboratory Manual Cold Spring Harbor Laboratory, 

25 1988. In general, the presence or absence of a cancer in a patient may be determined by 
(a) contacting a biological sample obtained from a patient with a binding agent; (b) 
detecting in the sample a level of polypeptide that binds to the binding agent; and (c) 
comparing the level of polypeptide with a predetermined cut-off value. 

In a preferred embodiment, the assay involves the use of binding agent 

30 immobilized on a solid support to bind to and remove the polypeptide from the 
remainder of the sample. The bound polypeptide may then be detected using a detection 
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reagent that contains a reporter group and specifically binds to the binding 
agent/polypeptide complex. Such detection reagents may comprise, for example, a 
binding agent that specifically binds to the polypeptide or an antibody or other agent 
that specifically binds to the binding agent, such as an anti-immunoglobulin, protein G, 
5 protein A or a lectin. Alternatively, a competitive assay may be utilized, in which a 
polypeptide is labeled with a reporter group and allowed to bind to the immobilized 
binding agent after incubation of the binding agent with the sample. The extent to 
which components of the sample inhibit the binding of the labeled polypeptide to the 
binding agent is indicative of the reactivity of the sample with the immobilized binding 
10 agent. Suitable polypeptides for use within such assays include full length lung tumor 
proteins and polypeptide portions thereof to which the binding agent binds, as described 
above. 

The solid support may be any material known to those of ordinary skill 
in the art to which the tumor protein may be attached. For example, the solid support 

15 may be a test well in a microtiter plate or a nitrocellulose or other suitable membrane. 
Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a 
plastic material such as polystyrene or polyvinylchloride. The support may also be a 
magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. 
Patent No. 5,359,681. The binding agent may be immobilized on the solid support 

20 using a variety of techniques known to those of skill in the art, which are amply 
described in the patent and scientific literature. In the context of the present invention, 
the term "immobilization" refers to both noncovalent association, such as adsorption, 
and covalent attachment (which may be a direct linkage between the agent and 
functional groups on the support or may be a linkage by way of a cross-linking agent). 

25 Immobilization by adsorption to a well in a microtiter plate or to a membrane is 
preferred. In such cases, adsorption may be achieved by contacting the binding agent, in 
a suitable buffer, with the solid support for a suitable amount of time. The contact time 
varies with temperature, but is typically between about 1 hour and about 1 day. In 
general, contacting a well of a plastic microtiter plate (such as polystyrene or 

30 polyvinylchloride) with an amount of binding agent ranging from about 10 ng to about 
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10 jig, and preferably about 100 ng to about 1 ng, is sufficient to immobilize an 
adequate amount of binding agent 

Covalent attachment of binding agent to a solid support may generally be 
achieved by first reacting the support with a bifunctional reagent that will react with 
5 both the support and a functional group, such as a hydroxyl or amino group, on the 
binding agent. For example, the binding agent may be covalently attached to supports 
having an appropriate polymer coating using benzoquinone or by condensation of an 
aldehyde group on the support with an amine and an active hydrogen on the binding 
partner {see, e.g., Pierce Immunotechnology Catalog and Handbook, 1991, at 
10 A12-A13). 

In certain embodiments, the assay is a two-antibody sandwich assay. 
This assay may be performed by first contacting an antibody that has been immobilized 
on a solid support, commonly the well of a microtiter plate, with the sample, such that 
polypeptides within the sample are allowed to bind to the immobilized antibody. 

15 Unbound sample is then removed from the immobilized polypeptide-antibody 
complexes and a detection reagent (preferably a second antibody capable of binding to a 
different site on the polypeptide) containing a reporter group is added. The amount of 
detection reagent that remains bound to the solid support is then determined using a 
method appropriate for the specific reporter group. 

20 More specifically, once the antibody is immobilized on the support as 

described above, the remaining protein binding sites on the support are typically 
blocked. Any suitable blocking agent known to those of ordinary skill in the art, such as 
bovine serum albumin or Tween 20™ (Sigma Chemical Co., St. Louis, MO). The 
immobilized antibody is then incubated with the sample, and polypeptide is allowed to 

25 bind to the antibody. The sample may be diluted with a suitable diluent, such as 
phosphate-buffered saline (PBS) prior to incubation. In general, an appropriate contact 
time (i.e. 5 incubation time) is a period of time that is sufficient to detect the presence of 
polypeptide within a sample obtained from an individual with lung cancer. Preferably, 
the contact time is sufficient to achieve a level of binding that is at least about 95% of 

30 that achieved at equilibrium between bound and unbound polypeptide. Those of 
ordinary skill in the art will recognize that the time necessary to achieve equilibrium 
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may be readily determined by assaying the level of binding that occurs over a period of 
time. At room temperature, an incubation time of about 30 minutes is generally 
sufficient. 

Unbound sample may then be removed by washing the solid support 
5 with an appropriate buffer, such as PBS containing 0.1% Tween 20™. The second 
antibody, which contains a reporter group, may then be added to the solid support. 
Preferred reporter groups include those groups recited above. 

The detection reagent is then incubated with the immobilized antibody- 
polypeptide complex for an amount of time sufficient to detect the bound polypeptide. 

10 An appropriate amount of time may generally be determined by assaying the level of 
binding that occurs over a period of time. Unbound detection reagent is then removed 
and bound detection reagent is detected using the reporter group. The method employed 
for detecting the reporter group depends upon the nature of the reporter group. For 
radioactive groups, scintillation counting or autoradiographic methods are generally 

15 appropriate. Spectroscopic methods may be used to detect dyes, luminescent groups 
and fluorescent groups. Biotin may be detected using avidin, coupled to a different 
reporter group (commonly a radioactive or fluorescent group or an enzyme). Enzyme 
reporter groups may generally be detected by the addition of substrate (generally for a 
specific period of time), followed by spectroscopic or other analysis of the reaction 

20 products. 

To determine the presence or absence of a cancer, such as lung cancer, 
the signal detected from the reporter group that remains bound to the solid support is 
generally compared to a signal that corresponds to a predetermined cut-off value. In 
one preferred embodiment, the cut-off value for the detection of a cancer is the average 

25 mean signal obtained when the immobilized antibody is incubated with samples from 
patients without the cancer. In general, a sample generating a signal that is three 
standard deviations above the predetermined cut-off value is considered positive for the 
cancer. In an alternate preferred embodiment, the cut-off value is determined using a 
Receiver Operator Curve, according to the method of Sackett et al., Clinical 

30 Epidemiology: A Basic Science for Clinical Medicine, Little Brown and Co., 1985, 
p. 106-7. Briefly, in this embodiment, the cut-off value may be determined from a plot 
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of pairs of true positive rates (z.e., sensitivity) and false positive rates (100%-specificity) 
that correspond to each possible cut-off value for the diagnostic test result. The cut-off 
value on the plot that is the closest to the upper left-hand corner (£e. t the value that 
encloses the largest area) is the most accurate cut-off value, and a sample generating a 
5 signal that is higher than the cut-off value determined by this method may be considered 
positive. Alternatively, the cut-off value may be shifted to the left along the plot, to 
minimize the false positive rate, or to the right, to minimize the false negative rate. In 
general, a sample generating a signal that is higher than the cut-off value determined by 
this method is considered positive for a cancer. 

10 In a related embodiment, the assay is performed in a flow-through or 

strip test format, wherein the binding agent is immobilized on a membrane, such as 
nitrocellulose. In the flow-through test, polypeptides within the sample bind to the 
immobilized binding agent as the sample passes through the membrane. A second, 
labeled binding agent then binds to the binding agent-polypeptide complex as a solution 

15 containing the second binding agent flows through the membrane. The detection of 
bound second binding agent may then be performed as described above. In the strip test 
format, one end of the membrane to which binding agent is bound is immersed in a 
solution containing the sample. The sample migrates along the membrane through a 
region containing second binding agent and to the area of immobilized binding agent 

20 Concentration of second binding agent at the area of immobilized antibody indicates the 
presence of a cancer. Typically, the concentration of second binding agent at that site 
generates a pattern, such as a line, that can be read visually. The absence of such a 
pattern indicates a negative result. In general, the amount of binding agent immobilized 
on the membrane is selected to generate a visually discernible pattern when the 

25 biological sample contains a level of polypeptide that would be sufficient to generate a 
positive signal in the two-antibody sandwich assay, in the format discussed above. 
Preferred binding agents for use in such assays are antibodies and antigen-binding 
fragments thereof. Preferably, the amount of antibody immobilized on the membrane 
ranges from about 25 ng to about lug, and more preferably from about 50 ng to about 

30 500 ng. Such tests can typically be performed with a very small amount of biological 
sample. 
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Of course, numerous other assay protocols exist that are suitable for use 
with the tumor proteins or binding agents of the present invention. The above 
descriptions are intended to be exemplary only. For example, it will be apparent to 
those of ordinary skill in the art that the above protocols may be readily modified to use 
5 tumor polypeptides to detect antibodies that bind to such polypeptides in a biological 
sample. The detection of such tumor protein specific antibodies may correlate with the 
presence of a cancer. 

A cancer may also, or alternatively, be detected based on the presence of 
T cells that specifically react with a tumor protein in a biological sample. Within 

10 certain methods, a biological sample comprising CD4 + and/or CD8 + T cells isolated 
from a patient is incubated with a tumor polypeptide, a polynucleotide encoding such a 
polypeptide and/or an APC that expresses at least an immunogenic portion of such a 
polypeptide, and the presence or absence of specific activation of the T cells is detected. 
Suitable biological samples include, but are not limited to, isolated T cells. For 

15 example, T cells may be isolated from a patient by routine techniques (such as by 
Ficoll/Hypaque density gradient centrifugation of peripheral blood lymphocytes). T 
cells may be incubated in vitro for 2-9 days (typically 4 days) at 37°C with polypeptide 
(e.g., 5-25 fig/ml). It may be desirable to incubate another aliquot of a T cell sample in 
the absence of tumor polypeptide to serve as a control. For CD4 + T cells, activation is 

20 preferably detected by evaluating proliferation of the T cells. For CD8 + T cells, 
activation is preferably detected by evaluating cytolytic activity. A level of proliferation 
that is at least two fold greater and/or a level of cytolytic activity that is at least 20% 
greater than in disease-free patients indicates the presence of a cancer in the patient 

As noted above, a cancer may also, or alternatively, be detected based on 

25 the level of mRNA encoding a tumor protein in a biological sample. For example, at 
least two oligonucleotide primers may be employed in a polymerase chain reaction 
(PCR) based assay to amplify a portion of a tumor cDNA derived from a biological 
sample, wherein at least one of the oligonucleotide primers is specific for (i.e., 
hybridizes to) a polynucleotide encoding the tumor protein. The amplified cDNA is 

30 then separated and detected using techniques well known in the art, such as gel 
electrophoresis. 
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Similarly, oligonucleotide probes that specifically hybridize to a 
polynucleotide encoding a tumor protein may be used in a hybridization assay to detect 
the presence of polynucleotide encoding the tumor protein in a biological sample. 

To permit hybridization under assay conditions, oligonucleotide primers 
5 and probes should comprise an oligonucleotide sequence that has at least about 60%, 
preferably at least about 75% and more preferably at least about 90%, identity to a 
portion of a polynucleotide encoding a tumor protein of the invention that is at least 10 
nucleotides, and preferably at least 20 nucleotides, in length. Preferably, 
oligonucleotide primers and/or probes hybridize to a polynucleotide encoding a 

10 polypeptide described herein under moderately stringent conditions, as defined above. 
Oligonucleotide primers and/or probes which may be usefully employed in the 
diagnostic methods described herein preferably are at least 10-40 nucleotides in length. 
In a preferred embodiment, the oligonucleotide primers comprise at least 10 contiguous 
nucleotides, more preferably at least 15 contiguous nucleotides, of a DNA molecule 

15 having a sequence as disclosed herein. Techniques for both PGR based assays and 
hybridization assays are well known in the art (see, for example, Mullis et al., Cold 
Spring Harbor Symp. Quant. Biol, 57:263, 1987; Erlich ed., PCR Technology, Stockton 
Press, NY, 1989). 

One preferred assay employs RT-PCR, in which PCR is applied in 
20 conjunction with reverse transcription. Typically, RNA is extracted from a biological 
sample, such as biopsy tissue, and is reverse transcribed to produce cDNA molecules. 
PCR amplification using at least one specific primer generates a cDNA molecule, which 
may be separated and visualized using, for example, gel electrophoresis. Amplification 
may be performed on biological samples taken from a test patient and from an 
25 individual who is not afflicted with a cancer. The amplification reaction may be 
performed on several dilutions of cDNA spanning two orders of magnitude. A two-fold 
or greater increase in expression in several dilutions of the test patient sample as 
compared to the same dilutions of the non-cancerous sample is typically considered 
positive. 

30 In another aspect of the present invention, cell capture technologies may 

be used in conjunction, with, for example, real-time PCR to provide a more sensitive 
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tool for detection of metastatic cells expressing lung tumor antigens. Detection of lung 
cancer cells in biological samples, e.g., bone marrow samples, peripheral blood, and 
small needle aspiration samples is desirable for diagnosis and prognosis in lung cancer 
patients. 

5 Immunomagnetic beads coated with specific monoclonal antibodies to 

surface cell markers, or tetrameric antibody complexes, may be used to first enrich or 
positively select cancer cells in a sample. Various commercially available kits may be 
used, including Dynabeads® Epithelial Enrich (Dynal Biotech, Oslo, Norway), 
StemSep™ (StemCell Technologies, Inc., Vancouver, BC), and RosetteSep (StemCell 

10 Technologies). A skilled artisan will recognize that other methodologies and kits may 
also be used to enrich or positively select desired cell populations. Dynabeads® 
Epithelial Enrich contains magnetic beads coated with mAbs specific for two 
glycoprotein membrane antigens expressed on normal and neoplastic epithelial tissues. 
The coated beads may be added to a sample and the sample then applied to a magnet, 

15 thereby capturing the cells bound to the beads. The unwanted cells are washed away 
and the magnetically isolated cells eluted from the beads and used in further analyses. 

RosetteSep can be used to enrich cells directly from a blood sample and 
consists of a cocktail of tetrameric antibodies that targets a variety of unwanted cells 
and crosslinks them to glycophorin A on red blood cells (RBC) present in the sample, 

20 forming rosettes. When centrifuged over Ficoll, targeted cells pellet along with the free 
RBC. The combination of antibodies in the depletion cocktail determines which cells 
will be removed and consequently which cells will be recovered. Antibodies that are 
available include, but are not limited to: CD2, CD3, CD4, CD5, CD8, CD10, CDllb, 
CD14, CD15, CD16, CD19, CD20, CD24, CD25, CD29, CD33, CD34, CD36, CD38, 

25 CD41, CD45, CD45RA, CD45RO, CD56, CD66B, CD66e, HLA-DR, IgE, and TCRap. 

Additionally, it is contemplated in the present invention that mAbs 
specific for lung tumor antigens can be generated and used in a similar manner. For 
example, mAbs that bind to tumor-specific cell surface antigens may be conjugated to 
magnetic beads, or formulated in a tetrameric antibody complex, and used to enrich or 

30 positively select metastatic lung tumor cells from a sample. Once a sample is enriched 
or positively selected, cells may be lysed and RNA isolated. RNA may then be 
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subjected to RT-PCR analysis using lung tumor-specific primers in a real-time PCR 
assay as described herein. One skilled in the art will recognize that enriched or selected 
populations of cells may be analyzed by other methods {e.g. in situ hybridization or 
flow cytometry). 

5 In another embodiment, the compositions described herein may be used 

as markers for the progression of cancer. In this embodiment, assays as described above 
for the diagnosis of a cancer may be performed over time, and the change in the level of 
reactive polypeptide(s) or polynucleotide(s) evaluated. For example, the assays may be 
performed every 24-72 hours for a period of 6 months to 1 year, and thereafter 

10 performed as needed. In general, a cancer is progressing in those patients in whom the 
level of polypeptide or polynucleotide detected increases over time. In contrast, the 
cancer is not progressing when the level of reactive polypeptide or polynucleotide either 
remains constant or decreases with time. 

Certain in vivo diagnostic assays may be performed directly on a tumor. 

15 One such assay involves contacting tumor cells with a binding agent. The bound 
binding agent may then be detected directly or indirectly via a reporter group. Such 
binding agents may also be used in histological applications. Alternatively, 
polynucleotide probes may be used within such applications. 

As noted above, to improve sensitivity, multiple tumor protein markers 

20 may be assayed within a given sample. It will be apparent that binding agents specific 
for different proteins provided herein may be combined within a single assay. Further, 
multiple primers or probes may be used concurrently. The selection of tumor protein 
markers may be based on routine experiments to determine combinations that results in 
optimal sensitivity. In addition, or alternatively, assays for tumor proteins provided 

25 herein may be combined with assays for other known tumor antigens. 

The present invention, further provides kits for use within any of the 
above diagnostic methods. Such kits typically comprise two or more components 
necessary for performing a diagnostic assay. Components may be compounds, reagents, 
containers and/or equipment. For example, one container within a kit may contain a 

30 monoclonal antibody or fragment thereof that specifically binds to a tumor protein. 
Such antibodies or fragments may be provided attached to a support material, as 
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described above. One or more additional containers may enclose elements, such as 
reagents or buffers, to be used in the assay. Such kits may also, or alternatively, contain 
a detection reagent as described above that contains a reporter group suitable for direct 
or indirect detection of antibody binding. 
5 Alternatively, a kit may be designed to detect the level of mRNA 

encoding a tumor protein in a biological sample. Such kits generally comprise at least 
one oligonucleotide probe or primer, as described above, that hybridizes to a 
polynucleotide encoding a tumor protein. Such an oligonucleotide may be used, for 
example, within a PCR or hybridization assay. Additional components that may be 
10 present within such kits include a second oligonucleotide and/or a diagnostic reagent or 
container to facilitate the detection of a polynucleotide encoding a tumor protein. 

The following examples are offered by way of illustration and not by 
way of limitation. 

EXAMPLES 

15 EXAMPLE 1 

Isolation and Characterization of cDNA Sequences Encoding 
Lung Tumor Polypeptides 

This example illustrates the isolation of cDNA molecules encoding lung 
tumor-specific polypeptides from lung tumor cDNA libraries. 

20 A. ISOLATION OF CDNA SEQUENCES FROM A LUNG SQUAMOUS CELL 
CARCINOMA LIBRARY 

A human lung squamous cell carcinoma cDNA expression library was 
constructed from poly A + RNA from a pool of two patient tissues using a Superscript 
Plasmid System for cDNA Synthesis and Plasmid Cloning kit (BRL Life Technologies, 
25 Gaithersburg, MD) following the manufacturer's protocol. Specifically, lung carcinoma 
tissues were homogenized with polytron (Kinematica, Switzerland) and total RNA was 
extracted using Trizol reagent (BRL Life Technologies) as directed by the manufacturer. 
The poly A + RNA was then purified using an oligo dT cellulose column as described in 
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Sambrook etal., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratories, Cold Spring Harbor, NY, 1989. First-strand cDNA was synthesized using 
the Notf/01igo-dT18 primer. Double-stranded cDNA was synthesized, ligated with 
BstXI/EcoRI adaptors (Invitrogen, San Diego, CA) and digested with Notl. Following 
5 size fractionation with cDNA size fractionation columns (BRL Life Technologies), the 
cDNA was ligated into the BstXI/NotI site of pcDNA3.1 (Invitrogen) and transformed 
into ElectroMax E. coli DH10B cells (BRL Life Technologies) by electroporation. 

Using the same procedure, a normal human lung cDNA expression 
library was prepared from a pool of four tissue specimens. The cDNA libraries were 

10 characterized by determining the number of independent colonies, the percentage of 
clones that carried insert, the average insert size and by sequence analysis. The lung 
squamous cell carcinoma library contained 2.7 x 10 6 independent colonies, with 100% 
of clones having an insert and the average insert size being 2100 base pairs. The normal 
lung cDNA library contained 1.4 x 10 6 independent colonies, with 90% of clones 

15 having inserts and the average insert size being 1800 base pairs. For both libraries, 
sequence analysis showed that the majority of clones had a full length cDNA sequence 
and were synthesized from mRNA 

cDNA library subtraction was performed using the above lung squamous 
cell carcinoma and normal lung cDNA libraries, as described by Hara et ah {Blood, 

20 54:189-199, 1994) with some modifications. Specifically, a lung squamous cell 
carcinoma-specific subtracted cDNA library was generated as follows. Normal tissue 
cDNA library (80 jig) was digested with BamHI and Xhol, followed by a filling-in 
reaction with DNA polymerase Klenow fragment. After phenol-chloroform extraction 
and ethanol precipitation, the DNA was dissolved in 133 (-il of H2O, heat-denatured and 

25 mixed with 133 jol (133 jag) of Photoprobe biotin (Vector Laboratories, Burlingame, 
CA). As recommended by the manufacturer, the resulting mixture was irradiated with a 
270 W sunlamp on ice for 20 minutes. Additional Photoprobe biotin (67 |J.l) was added 
and the biotinylation reaction was repeated. After extraction with butanol five times, 
the DNA was ethanol-precipitated and dissolved in 23 nl H 2 0 to form the driver DNA. 

30 To form the tracer DNA, 10 |ig lung squamous cell carcinoma cDNA 

library was digested with Notl and Spel, phenol chloroform extracted and passed 
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through Chroma spin-400 columns (Clontech, Palo Alto, CA). Typically, 5 ng of 
cDNA was recovered after the sizing column. Following ethanol precipitation, the 
tracer DNA was dissolved in 5 pi H 2 0. Tracer DNA was mixed with 15 nl driver DNA 
and 20 pi of 2 x hybridization buffer (1.5 M NaCl/10 mM EDTA/50 mM HEPES pH. 
5 7.5/0.2% sodium dodecyl sulfate), overlaid with mineral oil, and heat-denatured 
completely. The sample was immediately transferred into a 68 °C water bath and 
incubated for 20 hours (long hybridization [LH]). The reaction mixture was then 
subjected to a streptavidin treatment followed by phenol/chloroform extraction. This 
process was repeated three more times. Subtracted DNA was precipitated, dissolved in 

10 12 jlxI H 2 0, mixed with 8 fxl driver DNA and 20 \x\ of 2 x hybridization buffer, and 
subjected to a hybridization ,at 68 °C for 2 hours (short hybridization [SH]). After 
removal of biotinylated double-stranded DNA, subtracted cDNA was ligated into 
Notl/Spel site of chloramphenicol resistant pBCSK + (Stratagene, La Jolla, CA) and 
transformed into ElectroMax E. coli DH10B cells by electroporation to generate a lung 

15 squamous cell carcinoma specific subtracted cDNA library (herein after referred to as 
"lung subtraction I"). 

A second lung squamous cell carcinoma specific subtracted cDNA 
library (referred to as "lung subtraction II") was generated in a similar way to the lung 
subtraction library L, except that eight frequently recovered genes from lung subtraction 

20 I were included in the driver DNA, and 24,000 independent clones were recovered. 

To analyze the subtracted cDNA libraries, plasmid DNA was prepared 
from 320 independent clones, randomly picked from the subtracted lung squamous cell 
carcinoma specific libraries. Representative cDNA clones were further characterized by 
DNA sequencing with a Perkin Elmer/Applied Biosystems Division Automated 

25 Sequencer Model 373A and/or Model 377 (Foster City, CA). The cDNA sequences for 
sixty isolated clones are provided in SEQ ID NO: 1-60. These sequences were 
compared to known sequences in the gene bank using the EMBL and GenBank 
databases (release 96). No significant homologies were found to the sequences 
provided in SEQ ID NO: 2, 3, 19, 38 and 46. The sequences of SEQ ID NO: 1, 6-8, 10- 

30 13, 15, 17, 18, 20-27, 29, 30, 32, 34-37, 39-45, 47-49, 51, 52, 54, 55 and 57-59 were 
found to show some homology to previously identified expressed sequence tags (ESTs). 



Ill 



WO 02/47534 



PCT7US01/47576 



The sequences of SEQ ID NO: 9, 28, 3 1 and 33 were found to show some homology to 
previously identified non-human gene sequences and the sequences of SEQ ID NO: 4, 
5, 14, 50, 53, 56 and 60 were found to show some homology to gene sequences 
previously identified in humans. 
5 The subtraction procedure described above was repeated using the above 

lung squamous cell carcinoma cDNA library as the tracer DNA, and the above normal 
lung tissue cDNA library and a cDNA library from normal liver and heart (constructed 
from a pool of one sample of each tissue as described above), plus twenty other cDNA 
clones that were frequently recovered in lung subtractions I and II, as the driver DNA 

10 (lung subtraction IH). The normal liver and heart cDNA library contained 1.76 x 10 6 
independent colonies, with 100% of clones having inserts and the average insert size 
being 1600 base pairs. Ten additional clones were isolated (SEQ ID NO: 61-70). 
Comparison of these cDNA sequences with those in the gene bank as described above, 
revealed no significant homologies to the sequences provided in SEQ ID NO: 62 and 

15 67. The sequences of SEQ ID NO: 61, 63-66, 68 and 69 were found to show some 
homology to previously isolated ESTs and the sequence provided in SEQ ID NO: 70 
was found to show some homology to a previously identified rat gene. 

In further studies, the subtraction procedure described above was 
repeated using the above lung squamous cell carcinoma cDNA library as the tracer 

20 DNA, and a cDNA library from a pool of normal lung, kidney, colon, pancreas, brain, 
resting PBMC, heart, skin and esophagus as the driver DNA, with esophagus cDNAs 
making up one third of the driver material. Since esophagus is enriched in normal 
epithelial cells, including differentiated squamous cells, this procedure is likely to 
enrich genes that are tumor specific rather than tissues specific. The cDNA sequences 

25 of 48 clones determined in this subtraction are provided in SEQ ID NO: 177-224. The 
sequences of SEQ ID NO: 177, 178, 180, 181, 183, 187, 192, 195-197, 208, 211, 212, 
215, 216, 218 and 219 showed some homology to previously identified genes. The 
sequences of SEQ ID NO: 179, 182, 184-186, 188-191, 193, 194, 198-207, 209 210, 
213, 214, 217, 220 and 224 showed some homology to previously determined ESTs. 

30 The sequence of SEQ ED NO: 221-223 showed no homology to any previously 
determined sequence. 
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B. ISOLATION OF cDNA SEQUENCES FROM A LUNG 
ADENOCARCINOMA LIBRARY 

A human lung adenocarcinoma cDNA expression library was 
constructed as described above. The library contained 3.2 x 10 6 independent colonies, 
5 with 100% of clones having an insert and the average insert size being 1500 base pairs. 
Library subtraction was performed as described above using the normal lung and 
normal liver and heart cDNA expression libraries described above as the driver DNA 
Twenty-six hundred independent clones were recovered. 

Initial cDNA sequence analysis from 100 independent clones revealed 

10 many ribosomal protein genes. The cDNA sequences for fifteen clones isolated in this 
subtraction are provided in SEQ ID NO: 71-86. Comparison of these sequences with 
those in the gene bank as described above revealed no significant homologies to the 
sequence provided in SEQ ID NO: 84. The sequences of SEQ ID NO: 71, 73, 74, 77, 
78 and 80-82 were found to show some homology to previously isolated ESTs, and the 

15 sequences of SEQ ID NO: 72, 75, 76, 79, 83 and 85 were found to show some 
homology to previously identified human genes. 

In further studies, a cDNA library (referred to as mets3616A) was 
constructed from a metastatic lung adenocarcinoma. The determined cDNA sequences 
of 25 clones sequenced at random from this library are provided in SEQ ID NO: 255- 

20 279. The mets3616A cDNA library was subtracted against a cDNA library prepared 
from a pool of normal lung, liver, pancreas, skin, kidney, brain and resting PBMC. To 
increase the specificity of the subtraction, the driver was spiked with genes that were 
determined to be most abundant in the mets3616A cDNA library, such as EFl-alpha, 
integrin-beta and anticoagulant protein PP4, as well as with cDNAs that were 

25 previously found to be differentially expressed in subtracted lung adenocarcinoma 
cDNA libraries. The determined cDNA sequences of 51 clones isolated from the 
subtracted library (referred to as mets3616A-Sl) are provided in SEQ ID NO: 280-330. 

Comparison of the sequences of SEQ ID NO: 255-330 with those in the 
public databases revealed no significant homologies to the sequences of SEQ ID NO: 

30 255-258, 260, 262-264, 270, 272, 275, 276, 279, 281, 287, 291, 296, 300 and 310. The 
sequences of SEQ ID NO: 259, 261, 265-269, 271, 273, 274, 277, 278, 282-285, 288- 
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290, 292, 294, 297-299, 301, 303-309, 313, 314, 316, 320-324 and 326-330 showed 
some homology to previously identified gene sequences, while the sequences of SEQ ID 
NO: 280, 286, 293, 302, 310, 312, 315, 317-319 and 325 showed some homology to 
previously isolated expressed sequence tags (ESTs). 

5 EXAMPLE 2 

Determination of Tissue Specificity of Lung Tumor Polypeptides 

Using gene specific primers, mRNA expression levels for seven 
representative lung tumor polypeptides described in Example 1 were examined in a 
variety of normal and tumor tissues using RT-PCR. 

10 Briefly, total RNA was extracted from a variety of normal and tumor 

tissues using Trizol reagent as described above. First strand synthesis was carried out 
using 2 jig of total RNA with Superscript H reverse transcriptase (BRL Life 
Technologies) at 42 °C for one hour. The cDNA was then amplified by PCR with gene- 
specific primers. To ensure the semi-quantitative nature of the RT-PCR, P-actin was 

15 used as an internal control for each of the tissues examined. 1 \x\ of 1:30 dilution of 
cDNA was employed to enable the linear range amplification of the p-actin template 
and was sensitive enough to reflect the differences in the initial copy numbers. Using 
these conditions, the P-actin levels were determined for each reverse transcription 
reaction from each tissue. DNA contamination was minimized by DNase treatment and 

20 by assuring a negative PCR result when using first strand cDNA that was prepared 
without adding reverse transcriptase. 

mRNA Expression levels were examined in five different types of tumor 
tissue (lung squamous cell carcinoma from 3 patients, lung adenocarcinoma, colon 
tumor from 2 patients, breast tumor and prostate tumor), and thirteen different normal 

25 tissues (lung from 4 donors, prostate, brain, kidney, liver, ovary, skeletal muscle, skin, 
small intestine, stomach, myocardium, retina and testes). Using a 10-fold amount of 
cDNA, the antigen LST-S1-90 (SEQ ID NO: 3) was found to be expressed at high 
levels in lung squamous cell carcinoma and in breast tumor, and at low to undetectable 
levels in the other tissues examined. 
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The antigen LST-S2-68 (SEQ ID NO: 15) appears to be specific to lung 
and breast tumor, however, expression was also detected in normal kidney. Antigens 
LST-S1-169 (SEQ ID NO: 6) and LST-S1-133 (SEQ ID NO: 5) appear to be very 
abundant in lung tissues (both normal and tumor), with the expression of these two 
5 genes being decreased in most of the normal tissues tested. Both LST-S1-169 and LST- 
Sl-133 were also expressed in breast and colon tumors. Antigens LST-S1-6 (SEQ ID 
NO: 7) and LST-S2-I2-5F (SEQ ID NO: 47) did not show tumor or tissue specific 
expression, with the expression of LST-S1-28 being rare and only detectable in a few 
tissues. The antigen LST-S3-7 (SEQ ID NO: 63) showed lung and breast tumor specific 

10 expression, with its message only being detected in normal testes when the PCR was 
performed for 30 cycles. Lower level expression was detected in some normal tissues 
when the cycle number was increased to 35. Antigen LST-S3-13 (SEQ ID NO: 66) was 
found to be expressed in 3 out of 4 lung tumors, one breast tumor and both colon tumor 
samples. Its expression in normal tissues was lower compared to tumors, and was only 

15 detected in 1 out of 4 normal lung tissues and in normal tissues from kidney, ovary and 
retina. Expression of antigens LST-S3-4 (SEQ ID NO: 62) and LST-S3-14 (SEQ ID 
NO: 67) was rare and did not show any tissue or tumor specificity. Consistent with 
Northern blot analyses, the RT-PCR results on antigen LAT-S1-A-10A (SEQ ID NO: 
78) suggested that its expression is high in lung, colon, stomach and small intestine 

20 tissues, including lung and colon tumors, whereas its expression was low or 
undetectable in other tissues. 

A total of 2002 cDNA fragments isolated in lung subtractions I, II and 
HI, described above, were colony PCR amplified and their mRNA expression levels in 
lung tumor, normal lung, and various other normal and tumor tissues were determined 

25 using microarray technology (Synteni, Palo Alto, CA). Briefly, the PCR amplification 
products were dotted onto slides in an array format, with each product occupying a 
unique location in the array. mRNA was extracted from the tissue sample to be tested, 
reverse transcribed, and fluorescent-labeled cDNA probes were generated. The 
microarrays were probed with the labeled cDNA probes, the slides scanned and 

30 fluorescence intensity was measured. This intensity correlates with the hybridization 
intensity. Seventeen non-redundant cDNA clones showed over-expression in lung 
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squamous tumors, with expression in normal tissues tested (lung, skin, lymph node, 
colon ? liver, pancreas, breast, heart, bone marrow, large intestine, kidney, stomach, 
brain, small intestine, bladder and salivary gland) being either undetectable, or 10-fold 
less compared to lung squamous tumors. The determined cDNA sequences for the 
5 clone L513S are provided in SEQ ID NO: 87 and 88; those for L514S are provided in 
SEQ ID NO: 89 and 90; those for L516S in SEQ ID NO: 91 and 92; that for L517S in 
SEQ ID NO: 93; that for L519S in SEQ ID NO: 94; those for L520S in SEQ ID NO: 95 
and 96; those for L521S in SEQ ID NO: 97 and 98; that for L522S in SEQ ID NO: 99; 
that for L523S in SEQ ID NO: 100; that for L524S in SEQ ID NO: 101; that for L525S 

10 in SEQ ID NO: 102; that for L526S in SEQ ID NO: 103; that for L527S in SEQ ID NO: 
104; that for L528S in SEQ ID NO: 105; that for L529S in SEQ ID NO: 106; and those 
for L530S in SEQ ID NO: 107 and 108. Additionally, the full-length cDNA sequence 
for L530S is provided in SEQ ID NO: 151, with the corresponding amino acid sequence 
being provided in SEQ ID NO: 152. L530S shows homology to a splice variant of a 

15 p53 tumor suppressor homologue, p63. The cDNA sequences of 7 known isoforms of 
p63 are provided in SEQ ID NO: 331-337, with the corresponding amino acid 
sequences being provided in SEQ ID NO: 338-344, respectively. 

Due to polymorphisms, the clone L531S appears to have two forms. A 
first determined full-length cDNA sequence for L53 IS is provided in SEQ ID NO: 109, 

20 with the corresponding amino acid sequence being provided in SEQ ID NO: 110. A 
second determined full-length cDNA sequence for L531S is provided in SEQ ID NO: 
111, with the corresponding amino acid sequence being provided in SEQ ID NO: 112. 
The sequence of SEQ ID NO: 1 1 1 is identical to that of SEQ ID NO: 109, except that it 
contains a 27 bp insertion. Similarly, L514S has two alternatively spliced forms; the 

25 first variant cDNA is listed as SEQ ID NO: 153, with the corresponding amino acid 
sequence being provided in SEQ ID NO: 155. The full-length cDNA for the second 
variant form of L514S is provided in SEQ ID NO: 154, with the corresponding amino 
acid sequence being provided in SEQ ID NO: 156. 

Full length cloning for L524S (SEQ ED NO: 101) yielded two variants 

30 . (SEQ ED NO: 163 and 164) with the corresponding amino acid sequences of SEQ ID 
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NO: 165 and 166, respectively. Both variants have been shown to encode parathyroid 
hormone-related peptide. 

Attempts to isolate the full-length cDNA for L519S, resulted in the 
isolation of the extended cDNA sequence provided in SEQ ID NO: 173, which contains 
5 a potential open reading frame. The amino acid sequence encoded by the sequence of 
SEQ ID NO: 173 is provided in SEQ ID NO: 174. Additionally, the full-length cDNA 
sequence for the clone of SEQ ID NO: 100 (known as L523S), a known gene, is 
provided in SEQ ID NO: 175, with the corresponding amino acid sequence being 
provided in SEQ ID NO: 176. In further studies, a full-length cDNA sequence for 

10 L523S was isolated from a L523S-positive tumor cDNA library by PCR amplification 
using gene specific primers designed from the sequence of SEQ ID NO: 175. The 
determined full-length cDNA sequence is provided in SEQ ID NO: 347. The amino 
acid sequence encoded by this sequence is provided in SEQ ID NO: 348. This protein 
sequence differs from the previously published protein sequence at two amino acid 

1 5 positions, namely at positions 1 58 and 410. 

Comparison of the sequences of L514S and L531S (SEQ ID NO: 87 and 
88, and 109, respectively) with those in the gene bank, as described above, revealed no 
significant homologies to known sequences. The sequences of L513S, L516S, L517S, 
L519S, L520S and L530S (SEQ ID NO: 87 and 88, 91 and 92, 93, 94, 95 and 96, 107 

20 and 108, respectively) were found to show some homology to previously identified 
ESTs. The sequences of L521S, L522S, L523S, L524S, L525S, L526S, L527S, L528S 
and L529S (SEQ ID NO: 97 and 98, 99, 99, 101, 102, 103, 104, 105, and 106, 
respectively) were found to represent known genes. The determined full-length cDNA 
sequence for L520S is provided in SEQ ID NO: 113, with the corresponding amino 

25 acid sequence being provided in SEQ ID NO: 114. Subsequent microarray analysis 
showed L520S to be overexpressed in breast tumors in addition to lung squamous 
tumors. 

Further analysis demonstrated that L529S (SEQ ID NO: 106 and 115), 
. L525S (SEQ ID NO: 102 and 120) and L527S (SEQ ID NO: 104) are cytoskeletal 
30 components and potentially squamous cell specific proteins. L529S is connexin 26, a 
gap junction protein. It was found to be highly expressed in one lung squamous tumor, 
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referred to as 9688T, and moderately over-expressed in two others. However, lower 
level expression of connexin 26 is also detectable in normal skin, colon, liver and 
stomach. The over-expression of connexin 26 in some breast tumors has been reported 
and a mutated form of L529S may result in over-expression in lung tumors. L525S is 
5 plakophilin 1, a desmosomal protein found in plaque-bearing adhering junctions of the 
skin. Expression levels for L525S mRNA was highly elevated in three out of four lung 
squamous tumors tested, and in normal skin. L527S has been identified as keratin 6 
isoform, type II 58 Kd keratin and cytokeratin 13, and shows over-expression in 
squamous tumors and low expression in normal skin, breast and colon tissues. Keratin 
10 and keratin-related genes have been extensively documented as potential markers for 
lung cancer including CYFRA2.1 (Pastor, A., et al, Eur Respir. J., 10:603-609, 1997). 
L513S (SEQ ID NO: 87 and 88) shows moderate over-expression in several tumor 
tissues tested, and encodes a protein that was first isolated as a pemphigus vulgaris 
antigen. 

15 L520S (SEQ ID NO: 95 and 96) and L521S (SEQ ID NO: 97 and 98) are 

highly expressed in lung squamous tumors, with L520S being up-regulated in normal 
salivary gland and L521S being over-expressed in normal skin. Both belong to a family 
of small proline rich proteins and represent markers for fully differentiated squamous 
cells. L521S has been described as a specific marker for lung squamous tumor (Hu, R., 

20 et al, Lung Cancer, 20:25-30, 1998). L515S (SEQ ID NO: 162) encodes IGF-P2 and 
L516S is an aldose reductase homologue. Both are moderately expressed in lung 
squamous tumors and in normal colon. Notably, L516S (SEQ ID NO: 91 and 92) is up- 
regulated in metastatic tumors but not primary lung adenocarcinoma, an indication of its 
potential role in metatasis and a potential prognostic marker. L522S (SEQ ID NO: 99) 

25 is moderately over-expressed in lung squamous tumors with minimum expression in 
normal tissues. L522S has been shown to belong to a class IV alcohol dehydrogenase, 
ADH7, and its expression profile suggests it is a squamous cell specific antigen. L523S 
(SEQ ID NO: 100) is moderately over-expressed in lung squamous tumor, human 
pancreatic cancer cell lines and pancreatic cancer tissues, suggesting this gene may be a 

30 shared antigen between pancreatic and lung squamous cell cancer. 
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L524S (SEQ ID NO: 101) is over-expressed in the majority of squamous 
tumors tested and is homologous with parathyroid hormone-related peptide (PTHrP), 
which is best known to cause humoral hypercalcaemia associated with malignant 
tumors such as leukemia, prostate and breast cancer. It is also believed that PTHrP is 
5 most commonly associated with squamous carcinoma of lung and rarely with lung 
adenocarcinoma (Davidson, L.A., et al, J. Pathol, 178: 398-401, 1996). L528S (SEQ 
ID NO: 105) is highly over-expressed in two lung squamous tumors with moderate 
expression in two other squamous tumors, one lung adenocarcinoma and some normal 
tissues, including skin, lymph nodes, heart, stomach and lung. It encodes the NMB 

10 gene that is similar to the precursor of melanocyte specific gene Pmell7, which is 
reported to be preferentially expressed in low-metastatic potential melanoma cell lines. 
This suggests that L528S may be a shared antigen in both melanoma and lung squamous 
cell carcinoma, L526S (SEQ ED NO: 103) was overexpressed in all lung squamous cell 
tumor tissues tested and has been shown to share homology with a gene (ATM) in 

15 which a mutation causes ataxia telangiectasia, a genetic disorder in humans causing a 
predisposition to cancer, among other symptoms. ATM encodes a protein that activates 
a p53 mediated cell-cycle checkpoint through direct binding and phosphorylation of the 
p53 molecule. Approximately 40% of lung cancers are associated with p53 mutations, 
and it is speculated that over-expression of ATM is a result of compensation for loss of 

20 p53 function, but it is unknown whether over-expression is the cause of result of lung 
squamous cell carcinoma. Additionally, expression of L526S (ATM) is also detected in 
a metastatic but not lung adenocarcinoma, suggesting a role in metastasis. 

Expression of L523S (SEQ ID NO: 175), was examined by real time 
RT-PCR as described above. In a first study using a panel of lung squamous tumors, 

25 L523S was found to be expressed in 4/7 lung squamous tumors, 2/3 head and neck 
squamous tumors and 2/2 lung adenocarcinomas, with low level expression being 
observed in skeletal muscle, soft palate and tonsil. In a second study using a lung 
adenocarcinoma panel, expression of L523S was observed in 4/9 primary 
adenocarcinomas, 2/2 lung pleural effusions, 1/1 metastatic lung adenocarcinomas and 

30 2/2 lung squamous tumors, with little expression being observed in normal tissues. 
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Expression of L523S in lung tumors and various normal tissues was also 
examined by Northern blot analysis, using standard techniques. In a first study, L523S 
was found to be expressed in a number of lung adenocarcinomas and squamous cell 
carcinomas, as well as normal tonsil. No expression was observed in normal lung. In a 
5 second study using a normal tissue blot (referred to as HB-12) from Clontech, no 
expression was observed in brain, skeletal muscle, colon, thymus, spleen, kidney, liver, 
small intestine, lung or PBMC, although there was strong expression in placenta. 

EXAMPLE 3 

Isolation and Characterization of Lung Tumor Polypeptides 
10 by PCR-Based Subtraction 



Eight hundred and fifty seven clones from a cDNA subtraction library, 
containing cDNA from a pool of two human lung squamous tumors subtracted against 
eight normal human tissue cDNAs including lung, PBMC, brain, heart, kidney, liver, 
pancreas, and skin, (Clontech, Palo Alto, CA) were derived and submitted to a first 

15 round of PCR amplification. This library was subjected to a second round of PGR 
amplification, following the manufacturer's protocol. The resulting cDNA fragments 
were subcloned into the P7-Adv vector (Clontech, Palo Alto, CA) and transformed into 
DH5a E. coli (Gibco, BRL). DNA was isolated from independent clones and 
sequenced using a Perkin Elmer/Applied Biosystems Division Automated Sequencer 

20 Model 373A. 

One hundred* and sixty two positive clones were sequenced. Comparison 
of the DNA sequences of these clones with those in the EMBL and GenBank databases, 
as described above, revealed no significant homologies to 13 of these clones, hereinafter 
referred to as Contigs 13, 16, 17, 19, 22, 24, 29, 47, 49, 56-59. The determined cDNA 
25 sequences for these clones are provided in SEQ ID NO: 125, 127-129, 131-133, 142, 
144, 148-150, and 157, respectively. Contigs 1, 3-5, 7-10, 12, 1 1, 15, 20, 31, 33, 38, 39, 
41, 43, 44, 45, 48, 50, 53, 54 (SEQ ID NO: 115-124, 126, 130, 134-141, 143, 145-147, 
respectively) were found to show some degree of homology to previously identified 
DNA sequences. Contig 57 (SEQ ID NO: 149) was found to represent the clone L519S 
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(SEQ ID NO: 94) disclosed in US. Patent Application No. 09/123,912, filed July 27, 
1998. To the best of the inventors' knowledge, none of these sequences have been 
previously shown to be differentially over-expressed in lung tumors. 

mRNA expression levels for representative clones in lung tumor tissues, 
5 normal lung tissues (n=4), resting PBMC, salivary gland, heart, stomach, lymph nodes, 
skeletal muscle, soft palate, small intestine, large intestine, bronchial, bladder, tonsil, 
kidney, esophagus, bone marrow, colon, adrenal gland, pancreas, and skin (all derived 
from human) were determined by RT-PCR as described above. Expression levels using 
microarray technology, as described above, were examined in one sample of each tissue 

1 0 type unless otherwise indicated. 

Contig 3 (SEQ ID NO: 116) was found to be highly expressed in all head 
and neck squamous cell tumors tested (17/17), and expressed in the majority (8/12) of 
lung squamous tumors, (high expression in 7/12, moderate in 2/12, and low in 2/12), 
while showing negative expression for 2/4 normal lung tissues and low expression in 

15 the remaining two samples. Contig 3 showed moderate expression in skin and soft 
palate, and lowered expression levels in resting PBMC, large intestine, salivary gland, 
tonsil, pancreas, esophagus, and colon. Contig 1 1 (SEQ ID NO: 124) was found to be 
expressed in all head and neck squamous cell tumors tested (17/17), with high levels of 
expression being seen in 14/17 tumors, and moderately levels of expression being seen 

20 in 3/17 tumors. Additionally,, high expression was seen in 3/12 lung squamous tumors 
and moderate expression in 4/12 lung squamous tumors. Contig 1 1 was negative for 3/4 
normal lung samples, with the remaining sample having only low expression. Contig 
1 1 showed low to moderate reactivity to salivary gland, soft palate, bladder, tonsil, skin, 
esophagus, and large intestine. Contig 13 (SEQ ID NO: 125) was found to be expressed 

25 in all head and neck squamous cell tumors tested (17/17), with high expression in 
12/17, and moderate expression in 5/17. Contig 13 was expressed in 7/12 lung 
squamous tumors, with high expression in 4/12 and moderate expression in three 
samples. Analysis of normal lung samples showed negative expression for 2/4 and low 
to moderate expression in the remaining two samples. Contig 13 showed low to 

30 moderate reactivity to resting PBMC, salivary gland, bladder, pancreas, tonsil, skin, 
esophagus, and large intestine, as well as high expression in soft palate. Subsequent 
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full-length cloning efforts revealed that contig 13 (also known as L761P) maps to the 3' 
untranslated region of the hSeclOp gene. The full-length sequence for this gene is set 
forth in SEQ ID NO: 368, and encodes the protein set forth in SEQ ID NO: 369. 

Contig 16 (SEQ ID NO: 127) was found to be moderately expressed in 
5 several head and neck squamous cell tumors (6/17) and one lung squamous tumor, 
while showing no expression in any normal lung samples tested. Contig 16 showed low 
reactivity to resting PBMC, large intestine, skin, salivary gland, and soft palate. Contig 
17 (SEQ ID NO: 128) was shown to be expressed in all head and neck squamous cell 
tumors tested (17/17) (highly expressed in 5/17, and moderately expressed in 12/17). 

10 Determination of expression levels in lung squamous tumors showed one tumor sample 
with high expression and 3/12 with moderate levels. Contig 17 was negative for 2/4 
normal lung samples, with the remaining samples having only low expression. 
Additionally, low level expression was found in esophagus and soft palate. Contig 19 
(SEQ ID NO: 129) was found to be expressed in most head and neck squamous cell 

15 tumors tested (11/17); with two samples having high expression levels, 6/17 showing 
moderate expression, and low expression being found in 3/17. Testing in lung 
squamous tumors revealed only moderate expression in 3/12 samples. Expression 
levels in 2/4 of normal lung samples were negative, the two other samples having only 
low expression. Contig 19 showed low expression levels in esophagus, resting PBMC, 

20 salivary gland, bladder, soft palate and pancreas. 

Contig 22 (SEQ ID NO: 131), was shown to be expressed in most head 
and neck squamous cell tumors tested (13/17) with high expression in four of these 
samples, moderate expression in 6/17, and low expression in 3/17. Expression levels in 
lung squamous tumors were found to be moderate to high for 3/12 tissues tested, with 

25 negative expression in two normal lung samples and low expression in two other 
samples (n=4). Contig 22 showed low expression in skin, salivary gland and soft 
palate. Similarly, Contig 24 (SEQ ID NO: 132) was found to be expressed in most head 
and neck squamous cell tumors tested (13/17) with high expression in three of these 
samples, moderate expression in 6/17, and low expression in 4/17. Expression levels in 

30 lung squamous tumors were found to be moderate to high for 3/12 tissues tested, with 
negative expression for three normal lung samples and low expression in one sample 
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(n=4). Contig 24 showed low expression in skin, salivary gland and soft palate. 
Contig 29 (SEQ ID NO: 133) was expressed in nearly all head and neck squamous cell 
tumors tested (16/17): highly expressed in 4/17, moderately expressed in 11/17, with 
low expression in one sample. Also, it was moderately expressed in 3/12 Mng 
5 squamous tumors, while being negative for 2/4 normal lung samples. Contig 29 
showed low to moderate expression in large intestine, skin, salivary gland, pancreas, 
tonsil, heart and soft palate. Contig 47 (SEQ ID NO: 142) was expressed in most head 
and neck squamous cell tumors tested (12/17): moderate expression in 10/17, and low 
expression in two samples. In lung squamous tumors, it was highly expressed in one 
10 sample and moderately expressed in two others (n=13). Contig 47 was negative for 2/4 
normal lung samples, with the remaining two samples having moderate expression. 
Also, Contig 47 showed moderate expression in large intestine, and pancreas, and low 
expression in skin, salivary gland, soft palate, stomach, bladder, resting PBMC, and 
tonsil. 

15 Contig 48 (SEQ ID NO: 143) was expressed in all head and neck 

squamous cell tumors tested (17/17): highly expressed in 8/17 and moderately 
expressed in 7/17, with low expression in two samples. Expression levels in lung 
squamous tumors were high to moderate in three samples (n=13). Contig 48 was 
negative for one out of four normal lung samples, the remaining showing low or 

20 moderate expression. Contig 48 showed moderate expression in soft palate, large 
intestine, pancreas, and bladder, and low expression in esophagus, salivary gland, 
resting PBMC, and heart. Contig 49 (SEQ ID NO: 144) was expressed at low to 
moderate levels in 6/17 head and neck squamous cell tumors tested. Expression levels 
in lung squamous tumors were moderate in three samples (n=13). Contig 49 was 

25 negative for 2/4 normal lung samples, the remaining samples showing low expression. 
Moderate expression levels in skin, salivary gland, large intestine, pancreas, bladder and 
resting PBMC were shown, as well as low expression in soft palate, lymph nodes, and 
tonsil. Contig 56 (SEQ ID NO: 148) was expressed in low to moderate levels in 3/17 . 
head and neck squamous cell tumors tested, and in lung squamous tumors, showing low 

30 to moderate levels in three out of thirteen samples. Notably, low expression levels were 
detected in one adenocarcinoma lung tumor sample (n=2). Contig 56 was negative for 
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3/4 normal lung samples, and showed moderate expression levels in only large intestine, 
and low expression in salivary gland, soft palate, pancreas, bladder, and resting PBMC. 
Contig 58, also known as L769P, (SEQ ID NO: 150) was expressed at moderate levels 
in 11/17 head and neck squamous cell tumors tested and low expression in one 
5 additional sample. Expression in lung squamous tumors showed low to moderate levels 
in three out of thirteen samples. Contig 58 was negative for 3/4 normal lung samples, 
with one sample having low expression. Moderate expression levels in skin, large 
intestine, and resting PBMC were demonstrated, as well as low expression in salivary 
gland, soft palate, pancreas, and bladder. Contig 59 (SEQ ID NO: 157) was expressed in 

10 some head, neck, and lung squamous tumors. Low level expression of Contig 59 was 
also detected in salivary gland and large intestine. 

The full-length cDNA sequence for Contig 22, also referred to as L763P, 
is provided in SEQ ID NO: 158, with the corresponding amino acid sequence being 
provided in SEQ ID NO: 159. Real-time RT-PCR analysis of L763P revealed that it is 

15 highly expressed in 3/4 lung squamous tumors as well as 4/4 head and neck squamous 
tumors, with low level expression being observed in normal brain, skin, soft pallet and 
trachea. Subsequent database searches revealed that the sequence of SEQ ID NO: 158 
contains a mutation, resulting in a frameshift in the corresponding protein sequence. A 
second cDNA sequence for L763P is provided in SEQ ID NO: 345, with the 

20 corresponding amino acid sequence being provided in SEQ ID NO: 346. The sequences 
of SEQ ID NO: 159 and 346 are identical with the exception of the C-terminal 33 amino 
acids of SEQ ID NO: 159. 

The fiill-length cDNA sequence incorporating Contigs 17, 19, and 24, 
referred to as L762P, is provided in SEQ ID NO: 160, with the corresponding amino 

25 acid sequence being provided in SEQ ID NO: 161. Further analysis of L762P has 
determined it to be a type I membrane protein and two additional variants have been 
sequenced. Variant 1 (SEQ ID NO: 167, with the corresponding amino acid sequence 
in SEQ ID NO: 169) is an alternatively spliced form of SEQ ID NO: 160 resulting in 
deletion of 503 nucleotides, as well as deletion of a short segment of the expressed 

30 protein. Variant 2 (SEQ ID NO: 168, with the corresponding amino acid sequence in 
SEQ ID NO: 170) has a two nucleotide deletion at the 3' coding region in comparison 
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to SEQ ID NO: 160, resulting in a secreted form of the expressed protein. Real-time 
RT-PCR analysis of L762P revealed that is over-expressed in 3/4 lung squamous 
tumors and 4/4 head & neck tumors, with low level expression being observed in 
normal skill, soft pallet and trachea. 
5 An epitope of L762P was identified as having the sequence 

KPGHWTYTLNNTHHSLQALK (SEQ ID NO: 382), which corresponds to amino 
acids 571-590 of SEQ IDNO:161. 

The full-length cDNA sequence for contig 56 (SEQ ID NO: 148), also 
referred to as L773P, is provided in SEQ ID NO: 171, with the amino acid sequence in 

10 SEQ ID NO: 172. L773P was found to be identical to dihydroxyl dehydrogenase at the 
3' portion of the gene, with divergent 5' sequence. As a result, the 69 N-terminal amino 
acids are unique. The cDNA sequence encoding the 69 N-terminal amino acids is 
provided in SEQ ID NO: 349, with the N-terminal amino acid sequence being provided 
in SEQ ID NO: 350. Real-time PCR revealed that L773P is highly expressed in lung 

15 squamous tumor and lung adenocarcinoma, with no detectable expression in normal 
tissues. Subsequent Northern blot analysis of L773P demonstrated that this transcript is 
differentially over-expressed in squamous tumors and detected at approximately 1.6 Kb 
in primary lung tumor tissue and approximately 1.3 Kb in primary head and neck tumor 
tissue. 

20 Subsequent microarray analysis has shown Contig 58, also referred to as 

L769S (SEQ ID NO: 150), to be overexpressed in breast tumors in addition to lung 
squamous tumors. 

EXAMPLE 4 

Isolation and Characterization of Lung Tumor Polypeptides 
25 by PCR-Based Subtraction 

Seven hundred and sixty clones from a cDNA subtraction library, 
containing cDNA from a pool of two human lung primary adenocarcinomas subtracted 
against a pool of nine normal human tissue cDNAs including skin* colon, lung, 
esophagus, brain, kidney, spleen, pancreas and liver, (Clontech, Palo Alto, CA) were 
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derived and submitted to a first round of PCR amplification. This library (referred to as 
ALT-1) was subjected to a second round of PCR amplification, following the 
manufacturer's protocol. The expression levels of these 760 cDNA clones in lung 
tumor, normal lung, and various other normal and tumor tissues, were examined using 
5 microarray technology (Incyte, Palo Alto, CA). Briefly, the PCR amplification products 
were dotted onto slides in an array format, with each product occupying a unique 
location in the array. mRNA was extracted from the tissue sample to be tested, reverse 
transcribed, and fluorescent-labeled cDNA probes were generated. The microarrays 
were probed with the labeled cDNA probes, the slides scanned and fluorescence 

10 intensity was measured. This intensity correlates with the hybridization intensity.. A 
total of 1 18 clones, of which 55 were unique, were found to be over-expressed in lung 
tumor tissue, with expression in normal tissues tested (lung, skin, lymph node, colon, 
liver, pancreas, breast, heart, bone marrow, large intestine, kidney, stomach, brain, small 
intestine, bladder and salivary gland) being either undetectable, or at significantly lower 

15 levels. One of these clones, having the sequence as provided in SEQ ID NO:420 (clone 
#19014), shows homology to a previously identified clone, L773P. Clone L773P has 
the full-length cDNA sequence provided in SEQ ID NO: 171 and the amino acid 
sequence provided in SEQ ID NO:172 The isolation of clone #19014 is also described 
in co-pending U.S. Patent application 09/285,479, filed April 2, 1999. 

20 EXAMPLES 

Synthesis of Polypeptides 

Polypeptides may be synthesized on a Peririn Elmer/Applied Biosystems 
Division 430A peptide synthesizer using FMOC chemistry with HPTU (O- 
Benzotriazole-N 3 NJSf' ? N'-tetramethyluroniiim hexafluorophosphate) activation. A Gly- 
25 Cys-Gly sequence may be attached to the amino terminus of the peptide to provide a 
method of conjugation, binding to an immobilized surface, or labeling of the peptide. 
Cleavage of the peptides from the solid support is carried out using the following 
cleavage mixture: trifluoroacetic acid:ethanedithiol:thioanisole:water:phenol 
(40:1:2:2:3). After cleaving for 2 hours, the peptides are precipitated in cold methyl-t- 
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butyl-ether. The peptide pellets are then dissolved in water containing 0.1% 
trifluoroacetic acid (TFA) and lyophilized prior to purification by CI 8 reverse phase 
HPLC. A gradient of 0%-60% acetonitrile (containing 0.1% TFA) in water (containing 
0.1% TFA) may be used to elute the peptides. Following lyophilization of the pure 
5 fractions, the peptides are characterized using electrospray or other types of mass 
spectrometry and by amino acid analysis. 

EXAMPLE 6 

Preparation of Antibodies Against Lung Cancer Antigens 

Polyclonal antibodies against the lung cancer antigens L514S, L528S, 
10 L531S, L523 and L773P (SEQ ID NO: 155, 225, 112, 176 and 171, respectively) were 
prepared as follows. 

Rabbits were immunized with recombinant protein expressed in and 
purified from E coli as described below. For the initial immunization, 400 ^g of 
antigen combined with muramyl dipeptide (MDP) was injected subcutaneously (S.C.). 
15 Animals were boosted S.C. 4 weeks later with 200 jxg of antigen mixed with incomplete 
Freund's Adjuvant (IFA). Subsequent boosts of 100 \ig of antigen mixed with IFA 
were injected S.C. as necessary to induce high antibody titer responses. Serum bleeds 
from immunized rabbits were tested for antigen-specific reactivity using ELISA assays 
with purified protein. Polyclonal antibodies against L514S, L528S, L531S, L523S and 
20 L773P were affinity purified from high titer polyclonal sera using purified protein 
attached to a solid support. 

Immunohistochemical analysis using polyclonal antibodies against 
L514S was performed on a panel of 5 lung tumor samples, 5 normal lung tissue samples 
and normal colon, kidney, liver, brain and bone marrow. Specifically, tissue samples 
25 were fixed in formalin solution for 24 hours and embedded in paraffin before being 
sliced into 10 micron sections. Tissue sections were permeabilized and incubated with 
antibody for 1 hr. HRP-labeled anti-mouse followed by incubation with DAB 
chromogen was used to visualize L514S immunoreactivity. L514S was found to be 
highly expressed in lung tumor tissue with little or no expression being observed in 
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normal lung, brain or bone marrow. Light staining was observed in colon (epithelial 
crypt cells positive) and kidney (tubules positive). Staining was seen in normal liver but 
no mRNA has been detected in this tissue making this result suspect 

Using the same procedure, immunoMstochemical analysis using 
5 polyclonal antibodies against L528S demonstrated staining in lung tumor and normal 
lung samples, light staining in colon and kidney, and no staining in liver and heart. 

Immunohistochemical analysis using polyclonal antibodies against 
L531S demonstrated staining in lung tumor samples, light membrane staining in most 
normal lung samples, epithelial staining in colon, tubule staining in kidney, ductal 
1 0 epithelial staining in liver and no staining in heart. 

Immunohistochemical analysis using polyclonal antibodies against 
L523S demonstrated staining in all lung cancer samples tested but no staining in normal 
lung, kidney, liver, colon, bone marrow or cerebellum. 

Generation of polyclonal anti-sera against L762P (SEQ ID NO: 169 and 
15 170) was performed as follows. 400 micrograms of lung antigen was combined with 
100 micrograms of muramyldipeptide (MDP). An equal volume of Incomplete 
Freund's Adjuvant (IFA) was added and then mixed until an emulsion was formed. 
Rabbits were injected subcutaneously (S.C.). After four weeks the animals were 
injected S.C. with 200 micrograms of antigen mixed with an equal volume of IFA. 
20 Every four weeks animals were boosted with 100 micrograms of antigen. Seven days 
following each boost the animal was bled. Sera was generated by incubating the blood 
at 4°C for 12-24 hours followed by centrifugation. 

Characterization of polyclonal antisera was carried out as follows. 
Ninety-six well plates were coated with antigen by incubing with 50 microliters 
25 (typically 1 microgram) at 4°C for 20 hrs. 250 microliters of BSA blocking buffer was 
added to the wells and incubated at room temperature for 2 hrs. Plates were washed 6 
times with PBS/0.01% Tween. Rabbit sera was diluted in PBSand 50 microliters of 
diluted sera was added to each well and incubated at room temperature for 30 min. 
. Plates were washed as described above before addition of 50 microliters of goat anti- 
30 rabbit horse radish peroxidase (HRP) at a 1:10000 dilution and incubation at room 
temperature for 30 min. Plates were washed as described above and lOOfxl of TMB 
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Microwell Peroxidase Substrate was added to each well. Following a 15 minute 
incubation in the dark at room temperature, the colorimetric reaction was stopped with 
100|il IN H2SO4 and read immediately at 450 nm. Antisera showed strong reactivity to 
antigen L762P. 

5 Immunohistochemical analysis using polyclonal antibodies against 

L762P demonstrated staining in all lung cancer samples tested, some light staining in 
the bronchiole epithelium of normal lung, tubule staining in kidney, light epithelial 
staining in colon and no staining in heart or liver. 

In order to evaluate L773P protein expression in various tissues, 

10 immunohistochemistry (IHC) analysis was performed using an affinity purified L773P 
polyclonal antibody. Briefly, tissue samples were fixed in formalin solution for 12-24 
hrs and embedded in paraffin before being sliced into 8 micron sections. Steam heat 
induced epitope retrieval (SHIER) in 0.1 M sodiuym citrate buffer (pH 6.0) was used 
for optimal staining conditions. Sections were incubated with 10% serum/PBS for 5 

15 minutes. Primary antibody was added to each section for 25 minutes at indicated 
concentrations followed by 25 minute incubation with either anti-rabbit or anti-mouse 
biotinylated antibody. Endogenous peroxidase activitiy was blocked by three 1 .5 minute 
incubations with hydrogen peroxidase. The avidin biotin complex/horse radish 
peroxidase (ABC/HRP) system was used along with DAB chromogen to visualize 

20 L773P expression. Slides were counterstainied with hematoxylin to visualize cell 
nuclei. Using this approach, L773P protein was detected in 6/8 lung tumors, 4/6 normal 
lung samples (very light staining in some cases), 1/1 kidney samples (very light 
staining), 0/1 heart samples, 1/1 colon samples (very light staining) and 0/1 liver 
samples. 

25 EXAMPLE 7 

Peptide Priming of Mice and Propagation of CTL Lines 

Immunogenic peptides from the lung cancer antigen L762P (SEQ ID 
NO: 161) for HLA-A2/K b -restricted CD8+ T cells were identified as follows. 
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The location of HLA-A2 binding peptides within the lung cancer antigen 
L762P (SEQ ID NO: 161) was predicted using a computer program which predicts 
peptides sequences likely to being to HLA-A*0201 by fitting to the known peptide 
binding motif for HLA-A*0201 (Rupert et al (1993) Cell 74:929; Rammensee et al 
5 (1995) Immunogenetics ' 47:178-228). A series of 19 synthetic peptides corresponding 
to a selected subset of the predicted HLA-A*0201 binding peptides was prepared as 
described above. 

Mice expressing the transgene for human HLA A2/K b (provided by Dr L. 
Sherman, The Scripps Research Institute, La Jolla, CA) were immunized with the 

10 synthetic peptides, as described by Theobald et al., Proc. Natl Acad Set USA 
92:1 1993-1 1997, 1995, with the following modifications. Mice were immunized with 
SO^g of L726P peptide and 120|ig of an I-A b binding peptide derived from hepatitis B 
virus protein emulsified in incomplete Freund's adjuvant. Three weeks later these mice 
were sacrificed and single cell suspensions prepared. Cells were then resuspended at 7 

15 x 10 6 cells/ml in complete media (RPMI-1640; Gibco BRL, Gaithersburg, MD) 
containing 10% FCS, 2mM Glutamine (Gibco BRL), sodium pyruvate (Gibco BRL), 
non-essential amino acids (Gibco BRL), 2 x 10* 5 M 2-mercaptoethanol, 50U/ml 
penicillin and streptomycin, and cultured in the presence of irradiated (3000 rads) 
L762P peptide- (5p,g/ml) and lOmg/ml B 2 -microglobulin- (3 ng/ml) LPS blasts (A2 

20 transgenic spleens cells cultured in the presence of 7(ig/ml dextran sulfate and 25|ig/ml 
LPS for 3 days). After six days, cells (5 x 10 5 /ml) were restimulated with 2.5 x 10 6 /ml 
peptide-pulsed irradiated (20,000 rads) EL4A2Kb cells (Sherman et al, Science 
255:815-818, 1992) and 5 x 10 6 /ml irradiated (3000 rads) A2/K b -transgenic spleen 
feeder cells. Cells were cultured in the presence of lOU/ml IL-2. Cells were 

25 restimulated on a weekly basis as described, in preparation for cloning the line. 

Peptide-specific cell lines were cloned by limiting dilution analysis with 
irradiated (20,000 rads) L762P peptide-pulsed EL4 A2Kb tumor cells (1 x 10 4 
cells/well) as stimulators and irradiated (3000 rads) A2/K b -transgenic spleen cells as 
feeders (5 x 10 5 cells/ well) grown in the presence of lOU/ml IL-2. On day 7, cells were 

30 restimulated as before. On day 14, clones that were growing were isolated and 
maintained in culture. 



130 



WO 02/47534 



PCT7US01/47576 



Cell lines specific for the peptides L762P-87 (SEQ ID NO: 226; 
corresponding to amino acids 87-95 of SEQ ID NO: 161), L762P-145 (SEQ ID NO: 
227; corresponding to amino acids 145-153 of SEQ ID NO: 161), L762P-585 (SEQ ID 
NO: 228; corresponding to amino acids 585-593 of SEQ ED NO: 161), L762P-425 
5 (SEQ ID NO: 229; corresponding to amino acids 425-433 of SEQ ID NO: 161), 
L762P(10)-424 (SEQ ID NO: 230; corresponding to amino acids 424-433 of SEQ ID 
NO: 161) and L762P(10)-458 (SEQ ID NO: 231; corresponding to amino acids 458-467 
of SEQ ID NO: 161) demonstrated significantly higher reactivity (as measured by 
percent specific lysis) against L762P peptide-pulsed EL4-A2/K b tumor target cells than 
1 0 control peptide-pulsed EL4-A2/K b tumor target cells. 

EXAMPLE 8 

Identification of CD4 Immunogenic T Cell Epitopes Derived 
From the Lung Cancer Antigen L762p 

CD4 T cell lines specific for the antigen L762P (SEQ ID NO: 161) were 

15 generated as follows. 

A series of 28 overlapping peptides were synthesized that spanned 
approximately 50% of the L762P sequence. For priming, peptides were combined into 
pools of 4-5 peptides, pulsed at 20 micrograms/ml into dendritic cells for 24 hours. The 
dendritic cells were then washed and mixed with positively selected CD4+ T cells in 96 

20 well U-bottomed plates. Forty cultures were generated for each peptide pool. Cultures 
were restimulated weekly with fresh dendritic cells loaded with peptide pools. 
Following a total of 3 stimulation cycles, cells were rested for an additional week and 
tested for specificity to antigen presenting cells (APC) pulsed with peptide pools using 
interferon-gamma ELISA and proliferation assays. For these assays, adherent 

25 monocytes loaded with either the relevant peptide pool or an irrelevant peptide were 
used as APC. T cell lines that appeared to specifically recognize L762P peptide pools 
both by cytokine release and proliferation were identified for each pool. Emphasis was 
placed on identifying T cells with proliferative responses. T cell lines that demonstrated 
either both L762P-specific cytokine secretion and proliferation, or strong proliferation 
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alone were further expanded to be tested for recognition of individual peptides from the 
pools, as well as for recognition of recombinant L762P. The source of recombinant 
L762P was R coli, and the material was partially purified and endotoxin positive. 
These studies employed 10 micrograms of individual peptides, 10 or 2 micrograms of 
5 an irrelevant peptide, and 2 or 0.5 micrograms of either L762P protein or an irrelevant, 
equally impure, R coli generated recombinant protein. Significant interferon-gamma 
production and CD4 T cell proliferation was induced by a number of L762P-derived 
peptides in each pool. The amino acid sequences for these peptides are provided in 
SEQ ID NO: 232-251. These peptides correspond to amino acids 661-680, 676-696, 

10 526-545, 874-893, 811-830, 871-891, 856-875, 826-845, 795-815, 736-755, 706-725, 
706-725, 691-710, 601-620, 571-590, 556-575, 616-635, 646-665, 631-650, 541-560 
and 586-605, respectively, of SEQ ID NO: 161. 

CD4 T cell lines that demonstrated specificity for individual L762P- 
derived peptides were further expanded by stimulation with the relevant peptide at 10 

15 micrograms/ml. Two weeks post-stimulation, T cell lines were tested using both 
proliferation and IFN-gamma ELISA assays for recognition of the specific peptide. A 
number of previously identified T cells continued to demonstrate L762P-peptide 
specific activity. Each of these lines was further expanded on the relevant peptide and, 
following two weeks of expansion, tested for specific recognition of the L762P-peptide 

20 in titration experiments, as well as for recognition of recombinant E. co/z-derived L762P 
protein. For these experiments, autologous adherent monocytes were pulsed with either 
the relevant L762P-derived peptide, an irrelevant mammaglobin-derived peptide, 
recombinant R a>/z-derived L762P (approx. 50% pure), or an irrelevant R co/z-derived 
protein. The majority of T cell lines were found to show low affinity for the relevant 

25 peptide, since specific proliferation and IFN-gamma ratios dramatically decreased as 
L762P peptide was diluted. However, four lines were identified that demonstrated 
significant activity even at 0.1 micrograms/ml peptide. Each of these lines (referred to 
as A/D5, D/F5, E/A7 and E/B6) also appeared to specifically proliferate in response to 
the £ co/z-derived L762P protein preparation, but not in response to the irrelevant 

30 protein preparation. The amino acid sequences of the L762P-derived peptides 
recognized by these lines are provided in SEQ ID NO: 234, 249, 236 and 245, 
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respectively. No protein specific IFN-gamma was detected for any of the lines. Lines 
A/D5, E/A7 and E/B6 were cloned on autologous adherent monocytes pulsed with the 
relevant peptide at 0.1 (A/D5 and E/A7) or 1 (D/F5) microgram/ml. Following growth, 
clones were tested for specificity for the relevant peptide. Numerous clones specific for 
5 the relevant peptide were identified for lines A/D5 and E/A7. 

EXAMPLE 9 

Protein Expression of Lung Tumor-Specific Antigens 

a) Expression of L514S inl?. colt 

The lung tumor antigen L514S (SEQ ID NO: 89) was subcloned into the 
10 expression vector pE32b at Ncol and NotI sites, and transformed into E coli using 
standard techniques. The protein was expressed from residues 3-153 of SEQ ID NO: 
89. The expressed amino acid sequence and the corresponding DNA sequence are 
provided in SEQ ID NO: 252 and 253, respectively. 

b) Expression of L762P 

15 Amino acids 32-944 of the lung tumor antigen L762P (SEQ ID NO: 161), with a 6X His 
Tag, were subcloned into a modified pET28 expression vector, using kanamycin 
resistance, and transformed into BL21 CodonPlus using standard techniques. Low to 
moderate levels of expression were observed. The determined DNA sequence of the 
L762P expression construct is provided in SEQ ID NO: 254. 

20 EXAMPLE 10 

Identification of MHC class n restricting Allele for L762P Peptide-Specific 

Responses 

A panel of HLA mismatched antigen presenting cells (APC) were used 
to identify the MHC class II restricting allele for the L762P-peptide specific responses 
25 of CD4 T cell clones derived from lines that recognized L762P peptide and recombinant 
protein. Clones from two lines, AD-5 and EA-7, were tested as described below. The 
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IS 

AD-5 derived clones were found to be restricted by the HLA-DRB-1 101 allele, and an 
EA-7 derived clone was found to be restricted by the HLA DRB-0701 or DQB1-0202 
allele. Identification of the restriction allele allows targeting of vaccine therapies using 
the defined peptide to individuals that express the relevant class II allele. Knowing the 
5 relevant restricting allele will also enable clinical monitoring for responses to the 
defined peptide since only individuals that express the relevant allele will be monitored. 

CD4 T cell clones derived from line AD-5 and EA-7 were stimulated on 
autologous APC pulsed with the specific peptide at 10 j^g/ml, and tested for recognition 
of autologous APC (from donor D72) as well as against a panel of APC partially 

10 matched with D72 at class II alleles. Table 2 shows the HLA class typing of the APC 
tested. Adherent monocytes (generated by 2 hour adherence) from four different 
donors, referred to as D45, D187, D208, and D326, were used as APC in these 
experiments. Autologous APC were not included in the experiment. Each of the APC 
were pulsed with the relevant peptide (5a for AD-5 and 3e for 3A-7) or the irrelevant 

15 mammoglobin peptide at 10 (ig/ml, and cultures were established for 10,000 T cells and 
about 20,000 APC/well. As shown in Table 3, specific proliferation and cytokine 
production could be detected only when partially matched donor cells were used as 
APC. Based on the MHC typing analysis, these results strongly suggest that the 
restricting allele for the L762-specific response of the AD-5 derived clones is HLA- 

20 DRB-1 101 and for the EA-7 derived clone the restricting allele is HLA DRB-0701 or 
DQB1-0202. 



Table 2 - HLA Typing of APC 
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EXAMPLE 11 

Fusion Proteins of N-Terminal and C-Terminal Portions of L763P 

In another embodiment, a Mycobacterium tuberculosis-derived 
polynucleotide, referred to as Ral2, is linked to at least an immunogenic portion of a 
polynucleotide of this invention. Ral2 compositions and methods for their use in 
enhancing expression of heterologous polynucleotide sequences are described in U.S. 
Patent Application 60/158,585, the disclosure of which is incorporated herein by 
reference in its entirety. Briefly, Ral2 refers to a polynucleotide region that is a 
subsequence of a Mycobacterium tuberculosis MTB32A nucleic acid. MTB32A is a 
serine protease of 32 KD molecular weight encoded by a gene in virulent and avirulent 
strains of M. tuberculosis. The nucleotide sequence and amino acid sequence of 
MTB32A have been described (for example, U.S. Patent Application 60/158,585; see 
also, Skeiky et al 9 Infection and Immun. (1999) 67:3998-4007, incorporated herein by 
reference). Surprisingly, it was discovered that a 14 KD C-terminal fragment of the 
MTB32A coding sequence expresses at high levels on its own and remains as a soluble 
protein throughout the purification process. Moreover, this fragment may enhance the 
immunogenicity of heterologous antigenic polypeptides with which it is fused. This 14 
KD C-terminal fragment of the MTB32A is referred to herein as Ral2 and represents a 
fragment comprising some or all of amino acid residues 192 to 323 of MTB32A. 

Recombinant nucleic acids which encode a fusion polypeptide 
comprising a Ral2 polypeptide and a heterologous lung tumor polypeptide of interest, 
can be readily constructed by conventional genetic engineering techniques. 
Recombinant nucleic acids are constructed so that, preferably, a Ral2 polynucleotide 
sequence is located 5' to a selected heterologous lung tumor polynucleotide sequence. 
It may also be appropriate to place a Ral2 polynucleotide sequence 3' to a selected 
heterologous polynucleotide sequence or to insert a heterologous polynucleotide 
sequence into a site within a Ral2 polynucleotide sequence. 

In addition, any suitable polynucleotide that encodes a Ral2 or a portion 
or other variant thereof can be used in constructing recombinant fusion polynucleotides 
comprising Ral2 and one or more lung tumor polynucleotides disclosed herein. 
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Preferred Ral2 polynucleotides generally comprise at least about 15 consecutive 
nucleotides, at least about 30 nucleotides, at least about 60 nucleotides, at least about 
100 nucleotides, at least about 200 nucleotides, or at least about 300 nucleotides that 
encode a portion of a Ral2 polypeptide. 

Ral2 polynucleotides may comprise a native sequence (i.e., an 
endogenous sequence that encodes a Ral2 polypeptide or a portion thereof) or may 
comprise a variant of such a sequence. Ral2 polynucleotide variants may contain one 
or more substitutions, additions, deletions and/or insertions such that the biological 
activity of the encoded fusion polypeptide is not substantially diminished, relative to a 
fusion polypeptide comprising a native Ral2 polypeptide. Variants preferably exhibit at 
least about 70% identity, more preferably at least about 80% identity and most 
preferably at least about 90% identity to a polynucleotide sequence that encodes a native 
Ral2 polypeptide or a portion thereof . 

Two specific embodiments of fusions between Ral2 and antigens of the 
present invention are described in this example. 

A. N-Terminal Portion of L763P 

A fusion protein of full-length Ral2 and the N-tenninal portion of L763P (referred to as 
L763P-N; amino acid residues 1-130 of SEQ ID NO: 159) was expressed as a single 
recombinant protein in E. coli. The cDNA for the N-terminal portion was obtained by 
PCR with a cDNA for the full length L763P and primers L763F3 (5' 
CGGCGAATTCATGGATTGGGGGACGCTGC; SEQ ID NO: 383) and 1763RV3 (5' 
CGGCCTCGAGTCACCCCTCTATCCGAACCTTCTGC; SEQ ID NO: 384). The 
PCR product with expected size was recovered from agarose gel, digested with 
restriction enzymes EcoRI and Xhol, and cloned into the corresponding sites in the 
expression vector pCRXl. The sequence for the fusion of full-length of Ral2 and 
L763P-N was confirmed by DNA sequencing. The determined cDNA sequence is 
provided in SEQ ID NO:351, with the corresponding amino acid sequence being 
provided in SEQ ID NO: 352). 
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B. C-Terminal Portion of L763P 

A fusion protein of full-length Ral2 and the C-tenninal portion of L763P 
(referred to as L763P-C; amino acid residues 100-262 of SEQ ID NO: 159) was 
expressed as a single recombinant protein in E. colu The cDNA of the C-terminal 
portion of L763P was obtained by PCR with a cDNA for the full length of L763P and 
primers L763F4 (5' CGGCGAATTCCACGAACCACTCGCAAGTTCAG; SEQ ID 
NO: 385) and L763RV4 (5* CGGCTCGAG-TTAGCTTGGGCCTGTGATTGC; SEQ 
ID NO: 386). The PCR product with expected size was recovered from agarose gel, 
digested with restriction enzymes EcoRI and Xhol, and cloned into the corresponding 
sites in the expression vector pCRXl. The sequence for the fusion of full-length Ral2 
and L763P-C was confirmed by DNA sequencing. The determined DNA sequence is 
provided in SEQ ID NO:353, with the corresponding amino acid sequence being 
provided in SEQ ID NO: 354. 

The recombinant proteins described in this example are useful for the 
preparation of vaccines, for antibody therapeutics, and for diagnosis of lung tumors. 

EXAMPLE 12 
Expression in E Col/ of L762P His Tag Fusion Protein 

PCR was performed on the L762P coding region with the following 

primers: 

Forward primer starting at amino acid 32. 

PDM-278 5'ggagtacagcttcaagacaatggg 3' (SEQ ID NO:355) Tm 57°C. 
Reverse primer including natural stop codon after amino acid 920, 
creating EcoRI site 

PDM-280 5'ccatgggaattcattataataattttgttcc 3' (SEQ ID NO:356) 

TM55°C. 

The PCR product was digested with EcoRI restriction enzyme, gel 
purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
frame, which had been digested with Eco72I and EcoRI restriction en2ymes. The 
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correct construct was confirmed by DNA sequence analysis and then transformed into 
BL21 (DE3) pLys S and BL21 (DE3) CodonPlus RIL expression hosts. 

The protein sequence of expressed recombinant L762P is shown in SEQ 
ID NO:357, and the DNA sequence is shown in SEQ ID NO:358. 

EXAMPLE 13 

Expression in K Coli of a L773PA His Tag Fusion Protein 

The L773PA coding region (encoding amino acids 2-71 of SEQ ID NO: 
1 72) was PCR amplified using the following primers: 

Forward primer for L773PA starting at amino acid 2: 

PDM-299 5'tggcagcccctcttcttcaagtggc 3' (SEQ ID NO:359) Tm63°C. 

Reverse primer for L773PA creating artificial stop codon after amino 

acid 70: 

PDM-355 5'cgccagaattcatcaaacaaatctgttagcacc 3 ' (SEQ ID NO:360) 

Tm62°C. 

The resulting PCR product was digested with EcoRI restriction enzyme, 
gel purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
correct construct was confirmed by DNA sequence analysis and transformed into BL21 
(DE3) pLys S and BL21 (DE3) CodonPlus REL expression hosts. 

The protein sequence of expressed recombinant L773PA is shown in 
SEQ ID NO:361, and the DNA sequence is shown in SEQ ID NO:362. 

EXAMPLE 14 

Identification of Epitopes Derived From Lung Tumor Specific Polypeptides 

A series of peptides from the L773P amino acid sequence (SEQ ID NO: 
172) were synthesized and used in in vitro priming experiments to generate peptide- 
specific CD4 T cells. These peptides were 20-mers that overlapped by 15 amino acids 
and corresponded to amino acids 1-69 of the L773P protein. This region has been 
demonstrated to be tumor-specific. Following three in vitro stimulations, CD4 T cell 
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lines were identified that produced EFNy in response to the stimulating peptide but not 
the control peptide. Some of these T cell lines demonstrated recognition of recombinant 
L773P and L773PA (tumor-specific region) proteins. 

To perform the experiments, a total of eleven 20-mer peptides (SEQ ID 
NO: 363, 365 and 387-395) overlapping by 15 amino acids and derived from the N- 
tenrdnal tumor-specific region of L773P (corresponding to amino acids 1-69 of SEQ ID 
NO: 172) were generated by standard procedures. Dendritic cells were derived from 
PBMC of a normal donor using GMCSF and IL-4 by standard protocol. Purified CD4 T 
cells were generated from the same donor as the dendritic cells using MACS beads and 
negative selection of PBMCs. Dendritic cells were pulsed overnight with the individual 
20-mer peptides at a concentration of 10 jig/ml. Pulsed dendritic cells were washed and 
plated at 1 x 10 4 /well of a 96-well U-bottom plates, and purified CD4 cells were added 
at 1 x 10 5 well. Cultures were supplemented with 10 ng/ml DL-6 and 5 ng/ml DL-12, and 
incubated at 37°C. Cultures were re-stimulated as above on a weekly basis using as 
APC dendritic cells generated and pulsed as above, supplemented with 5 ng/ml EL-7 and 
10 ng/ml LL-2. Following 3 in vitro stimulation cycles, cell lines (each corresponding to 
one well) were tested for cytokine production in response to the stimulating peptide vs. 
an irrelevant peptide. 

A small number of individual CD4 T cell lines (9/528) demonstrated 
cytokine release (IFNy) in response to the stimulating peptide but not to control peptide. 
The CD4 T cell lines that demonstrated specific activity were restimulated on the 
appropriate L773P peptide and reassayed using autologous dendritic cells pulsed with 
10 ng/ml of the appropriate L773P peptide, an irrelevant control peptide, recombinant 
L773P protein (amino acids 2-364, made in E. coli), recombinant L773PA (amino acids 
2-71, made in E coli), or an appropriate control protein (L3E, made in E. coli). Three 
of the nine lines tested (1-3C, 1-6G, and 4-12B) recognized the appropriate L773P 
peptide as well as recombinant L773P and L773PA. Four of the lines tested (4-8A, 4- 
8E, 4-12D, and 4-12E) recognized the appropriate L773P peptide only. Two of the 
lines tested (5-6F and 9-3B) demonstrated non-specific activity. 

These results demonstrate that the peptide sequences 
MWQPLFFKWLLSCCPGSSQI (amino acids 1-20 of SEQ ID NO: 172; SEQ ID 
NO:363) and GSSQIAAAASTQPEDDDSfTQ (amino acids 16-35 of SEQ ID NO: 172; 
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SEQ ID NO: 365) may represent naturally processed epitopes of L773P, which are 
capable of stimulating human class II MHC-restricted CD4 T cell responses. 

In subsequent studies, the above epitope mapping experiment was 
repeated using a different donor. Again, some of the resulting T cell lines were found to 
respond to peptide and recombinant protein. An additional peptide was found to be 
naturally processed. Specifically, purified CD4 cells were stimulated on a total of 
eleven 20-mer peptides overlapping by 15 amino acids (SEQ ID NO: 363, 387, 388, 365 
and 389-395, respectively). The priming was carried out as described above, except that 
a peptide concentration of 0.5 ug/mL rather than 10 ug/mL was employed. In the initial 
screen of the cell lines 9 of the 528 lines released at least a three-fold greater level of 
IFN-gamma with stimulating peptide vs. control peptide. These 9 lines were 
restimulated on the appropriate peptide and then tested on dendritic cells pulsed with a 
titration of appropriate peptide (10 ug/mL, 1 ug/mL and 0.1 ug/mL), and 10 ug/mL of a 
control peptide. Six of the 9 lines recognized recombinant L773P as well as peptide. 
The six lines referred to as ME, 1-2E, 1-4H, 1-6A, 1-6G and 2-12B recognized 
L773PA and the appropriate peptide. These results demonstrate that the peptides of 
SEQ ID NO: 363 and 387 represent naturally processed epitopes of L773P. 

Using the procedures described above, CD4+ T cell responses were 
generated from PBMC of normal donors using dendritic cells pulsed with overlapping 
20-mer peptides (SEQ ID NO: 396-419) spanning the L523S polypeptide sequence 
(SEQ ID NO: 176). A number of CD4+ T cells demonstrated reactivity with the 
priming peptides as well as with L523S recombinant protein, with the dominant 
reactivity of these lines being within the peptides 4, 7 and 21 (SEQ ID NO: 399, 402 
and 416; coiresponding to amino acids 30-39, 60-79 and 200-219, respectively, of SEQ 
ID NO: 176). 

Epitopes within the scope of the invention include epitopes restricted by 
other class II MHC molecules. In addition, variants of the peptide can be produced 
wherein one or more amino acids are altered such that there is no effect on the ability of 
the peptides to bind to MHC molecules, no effect on their ability to elicit T cell 
responses, and no effect on the ability of the elicited T cells to recognize recombinant 
protein. 
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EXAMPLE 15 

Surface Expression of L762P AND Antibody Epitopes Thereof 

Rabbits were immunized with full-length histidine-tagged L762P protein 
generated in E. coll Sera was isolated from rabbits and screened for specific 
recognition of L762P in ELISA assays. One polyclonal serum, referred to as 2692L, was 
identified that specifically recognized recombinant L762P protein. The 2692L anti- 
L762P polyclonal antibodies were purified from the serum by affinity purification using 
L762P affinity columns. Although L762P is expressed in a subset of primary lung tumor 
samples, expression appears to be lost in established lung tumor cell lines. Therefore, 
to characterize surface expression of L762P, a retrovirus construct that expresses L762P 
was used to transduce primary human fibroblasts as well as 3 lung tumor cell lines (522- 
23, HTB, and 343T). Transduced lines were selected and expanded to examine L762P 
surface expression by FACS analysis. For this analysis, non-transduced and transduced 
cells were harvested using cell dissociation medium, and incubated with 10-50 
micrograms/ml of either affinity purified anti-L762P or irrelevant antisera. Following a 
30 minute incubation on ice, cells were washed and incubated with a secondary, FITC 
conjugated, anti rabbit IgG antibody as above. Cells were washed, resuspended in buffer 
with Propidium Iodide (PI) and examined by FACS using an Excalibur fluorescence 
activated cell sorter. For FACS analysis, Pi-positive (i.e. dead/permeabilized cells) were 
excluded. The polyclonal anti-L762P sera specifically recognized and bound to the 
surface of L762P-transduced cells but not the non-transduced counterparts. These 
results demonstrate that L762P is localized to the cell surface of both fibroblasts as well 
as lung tumor cells. 

To identify the peptide epitopes recognized by 2692L, an epitope 
mapping approach was pursued. A series of overlapping 19-21 mers (5 amino acid 
overlap) was synthesized that spanned the C terminal portion of L762P (amino acids 
481-894 of SEQ ID NO: 161). In an initial experiment peptides were tested in pools. 
Specific reactivity with the L762P antiserum was observed with pools A, B, C, and E. 
To identify the specific peptides recognized by the antiserum, flat bottom 96 well 
microtiter plates were coated with individual peptides at 10 microgram/ml for 2 hours at 
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37 °C. Wells were then aspirated and blocked with phosphate buffered saline containing 
5% (w/v) milk for 2 hours at 37 °C, and subsequently washed in PBS containing 0.1% 
Tween 20 (PBST). Purified rabbit anti-L762P serum 2692L was added at 200 or 20 
ng/well to triplicate wells in PBST and incubated overnight at room temperature. This 
was followed by washing 6 times with PBST and subsequently incubating with HRP- 
conjugated donkey anti rabbit IgG (H+L)Affinipure F(ab') fragment at 1:2,000 for 60 
minutes. Plates were then washed, and incubated in tetramethyl benzidine substrate. 
Reactions were stopped by the addition of IN sulfuric acid and plates were read at 
450/570 run using an ELISA plate reader. 

The resulting data, presented in Table 4 below, demonstrates that the 
L762P antisera recognized at least 6 distinct peptide epitopes from the 3 f half of L762P. 
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Table 4 



ELISA activity (OD 450-570 



Peptide (starting amino 
acid of L762P) 


pool 


200 ng polyclonal 
serum 


20 ng polyclonal 
serum 


: ■■■ . a (48i) %■ 


a, v 


;i>76 : 


' - l.o • • 


B (495) 


A 


0.14 


.06 






: -wo: '•'•<"« 


omw^'- 


D(526) 


E 


0.11 


0.09 


E (541) 


A 


0.11 


0.04 


F(556) 


A 


0.04 


0.02 


G(571) 


A 


0.06 


0.02 


H(586) 


B 


0.1 


0.03 


;o V' : n-»)l) 




" : •"#%q.25^^rH 




J (616) 


B 


0.1 


0.03 


K(631) 


E 


0.1 


0.08 






■' * 




M (661) 


B 


0.14 


0.03 


N(676) 


C 


0.12 


0.1 


••• • ;; : • ' m(691)Al;l : ff: - / 




. LI 


'-'0.23 y.z^ 


P(706) 


C 


0.1 


0.03 


0 (721) 


C 


0.11 


0.05 


R(736) 


E 


0.12 


0.04 


S (751) 


C 


0.15 


0.06 


U (781) 


D 


0.12 


0.06 


V (795) 


F 


0.07 


0.05 


X(826) 


D 


0.1 


0.03 


Y (841) 


D 


0.17 


0.07 


Z(856) 


D 


0.16 


0.08 


AA (871) 


F 


0.17 


0.05 


•. .•:-^:kBB.(S74y^ ,, $ 


< • 




: ?•..:*: ; > -^' 


No peptide 




0.15 


0.045 



Individual peptides were identified from each of the pools, and 
additionally a weak reactivity was identified with peptide BB from pool F. The relevant 
peptide epitopes are summarized in the Table 5 below The amino acid sequences for 
peptides BB, O, L, I, A and C are provided in SEQ ID NO: 376-381, respectively, with 
the corresponding cDNA sequences being provided in SEQ ID NO: 373, 370, 372, 374, 
371 and 375, respectively. 
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Table 5 
ELIS A activity 
(OD 450-570) 



Peptide 


Nucleotides 
ofL762P 


Amino 
acids of 
L762P 


Sequence 


pool 


200 ng 


20 ng 


A 


1441-1500 


481-500 


SRISSGTGDIFQQHIQLEST 


A 


1.76 


1.0 


C 


1531-1590 


511-530 


KNTVTVDNTVGNDTMFLVTW 


E 


0.47 


0.18 


I 


1801-1860 


601-620 


AVPPATVEAFVERDSLHFPH 


B 


025 


0.06 


L 


1936-1955 


646-665 


PETGDPVTLRLLDDGAGADV 


B 


0.28 


0.12 


O 


2071-2130 


691-710 


VNHSPSISTPAHSIPGSHAM1L 


C 


LI 


0.23 


BB 


2620-2679 


874-893 


LQSAVSNIAQAPLFIPPNSD 


F 


0.14 


0.11 


None 










0.15 


0.05 



EXAMPLE 16 

Detection of Antibodies Against Lung Tumor Antigens in Patient Sera 

Antibodies specific for the lung tumor antigens L773PA (SEQ ID 
NO:361), L514S (SEQ ID NO:155 and 156), L523S (SEQ ID NO:176), L762P (SEQ ID 
NO:161) and L763P (SEQ ID NO:159) were shown to be present in effusion fluid or 
sera of lung cancer patients but not in normal donors. More specifically, the presence of 
antibodies against L773PA, L514S, L523S, L762P and L763P in eflusion fluid obtained 
from lung cancer patients and in sera from normal donors was detected by ELISA using 
recombinant proteins and HRP-conjugated anti-human Ig. Briefly, each protein (100 
ng) was coated in 96-well plate at pH 9.5. In parallel, BSA (bovine serum albumin) was 
also coated as a control protein. The signals ([S], absorbance measured at 405 nm) 
against BSA ([N]) were determined. The results of these studies are shown in Table 6, 
wherein - represents [S]/[N] < 2; +/- represents [S]/[N] >2; ++ represents [S]/[N] >3; 
and +++ represents [S]/[N] >5. 
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Table 6 



Detection of Antibodies Against Lung Tumor Antigens 





L514S 


L523S 


L762P 


L763P 


L773PA 


Effusion fluid 












#1 


+++ 


++ 


++ 


- 


++ 


#2 


' - 


- 


+/- 


++ 


+/- 


• #3 


- 


- 


- 


- 


+/- 


I #4 


+/- 


++ 


+/- 


- 


+/- 


#5 


+/- 


+++ 


+/- 


+/- 


++ ; 


#7 


- 


+/- 


- 


- 


+/- 


#8 


- 


+++ 


- 


- 


++ 


#10 


- 


++ 


+/- 


+/- 


- 


#11 


+/- 


++ 


++ 


- 


++ 


#12 


+++ 


+/- 


- 


+/- 


+/- 


#13 


- 


+/- 


- 


- 


+/- 


#14 


- 


+++ 


+/- 




++ 


#15 


+/- 


++ 


+/- 




++ 


#17 




+/- 






+/- 


#18 




++ . 








#19 




+/- 






+/- 


#20 


+/- 


+/- 


+/- 




+/- 


Normal sera 












#21 




+/- 








#22 ' 












#23 










+/- 


#24 




+/- 








#25 


+/- 


+/- 






+/- 



Using Western blot analyses, antibodies against L523S were found to be 
present in 3 out of 4 samples of efiusion fluid from lung cancer patients, with no L523S 
antibodies being detected in the three samples of normal sera tested. 



EXAMPLE 17 

Expression in R Coli of a L5 14S His Tag Fusion Protein 

PCR was performed on the L514S-13160 coding region with the 
following primers: 

Forward primer PDM-278 5' cacactagtgtccgcgtggcggcctac 3' (SEQ ID 
NO:421)Tm67°C. 
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Reverse primer PDM-280 5' catgagaattcatcacatgcccttgaaggctccc 3' 
(SEQ ID NO:422) TM 66°C. 

The PCR conditions were as follows: 

10^1 10XP& buffer 

l%110mMdNTPs 

2.0jxl 10[M each primer 

83|il sterile water 

1 .5fil Pfu DNA polymerase (Stratagene, La Jolla, CA) 
50rigDNA 

96°C for 2 minutes, 96°C for 20 seconds, 66°C for 15 seconds, 72°C for 
1 minute with 40 cycles and then 72°C for 4 minutes. 

The PCR product was digested with EcoRI restriction enzyme, gel 
purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
correct construct was confirmed by DNA sequence analysis and then transformed into 
BL21 CodonPlus (Stratagene, La Jolla, CA) cells for expression. 

The amino acid sequence of expressed recombinant L514S is shown in 
SEQ ID NO:423, and the DNA coding region sequence is shown in SEQ ID NO:424. 

EXAMPLE 18 

Expression in E Con of a L523S His Tag Fusion Protein 
PCR was performed on the L523S coding region with the following 

primers: 

Forward primer PDM-414 5' aacaaactgtatatcggaaacctcagcgagaa 3' (SEQ 
IDNO:425)Tm62 0 C. 

Reverse primer PDM-415 5' ccatagaattcattacttccgtcttgactgagg 3' (SEQ 
IDNO:426)TM62°C. 

The PCR conditions were as follows: 
lOfillOXPfu buffer 
l.OpllOmMdNTPs 
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2.0|il lOpM each primer 
83jil sterile water 

L5|d Pfu DNA polymerase (Stratagene, La Jolla, CA) 
50rigDNA 

96°C for 2 minutes, 96°C for 20 seconds, 62°C for 15 seconds, 72°C for 
4 minutes with 40 cycles and then 72°C for 4 minutes. 

The PCR product was digested with EcoRI restriction enzyme, gel 
purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
correct construct was confirmed by DNA sequence analysis and then transformed into 
BL21 CodonPlus (Stratagene, La Jolla, CA) cells for expression. 

The amino acid sequence of expressed recombinant L523S is shown in 
SEQ ID NO:427, and the DNA coding region sequence is shown in SEQ ID NO:428. 

EXAMPLE 19 

Expression in e. Coli of a L762P A His Tag Fusion Protein 

PCR was performed on the L762PA coding region (L762PA is missing 
the signal sequence, the C-terminal transmembrane domain and the cytoplasmic tail) 
with the following primers: 

Forward primer PDM-278 5'ggagtacagcttcaagacaatggg 3' (SEQ ID 
NO:355)Tm57°C. 

Reverse primer PDM-279 5'ccatggaattcattatttcaatataagataatctc 3' (SEQ 
IDNO:429)TM56°C 

The PCR conditions were as follows: 
lO^illOX Pfu buffer 
1.0^1 lOmMdNTPs 
2.0jj,1 IO^iM each primer 
83|Ld sterile water 

1 .5(il Pfu DNA polymerase (Stratagene, La Jolla, CA) 
50iigDNA 
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96°C for 2 minutes, 96°C for 20 seconds, 55°C for 15 seconds, 72°C for 
5 minutes with 40 cycles and then 72°C for 4 minutes. 

The PCR product was digested with EcoRI restriction enzyme, gel 
purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
correct construct was confirmed by DNA sequence analysis and then transformed into 
BL21 pLys S (Novagen, Madison, WI) cells for expression. 

The amino acid sequence of expressed recombinant L762PA is shown in 
SEQ ID NO:430, and the DNA coding region sequence is shown in SEQ ID NO:43 1 . 

EXAMPLE 20 

Expression in el Coli of a L773P His Tag Fusion Protein 
PCR was performed on the L773P coding region with the following 

primers: 

Forward primer PDM-299 5* tggcagcccctcttcttcaagtggc 3' (SEQ ID 
NO:359)Tm63°C. 

Reverse primer PDM-300 5' cgcctgctcgagtcattaatattcatcagaaaatgg 3' 
(SEQ ID NO:432) TM 63°C. 

The PCR conditions were as follows: 
lO^illOXPfu buffer 
LOjxl lOmMdNTPs 
2.0pl lOpM each primer 
83jil sterile water 

1.5jxl Pfu DNA polymerase (Stratagene, La Jolla, CA) 
50r|g DNA 

96°C for 2 minutes, 96°C for 20 seconds, 63°C for 15 seconds, 72°C for 
2 minutes 15 seconds with 40 cycles and then 72°C for 4 minutes. 

The PCR product was digested with EcoRI restriction enzyme, gel 
purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
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correct construct was confirmed by DNA sequence analysis and then transformed into 
BL21 pLys S (Novagen, Madison, WI) and BL21 CodonPlus (Stratagene, La Jolla, CA) 
cells for expression. 

The amino acid sequence of expressed recombinant L773P is shown in 
SEQ ID NO:433, and the DNA coding region sequence is shown in SEQ ID NO:434. 

EXAMPLE 21 
Cloning and Sequencing of a T-Cell Receptor Clone 
for the Lung Specific Antigen L762P 

T cell receptor (TCR) alpha and beta chains from a CD4 T cell clone 
specific for the lung specific antigen L762P were cloned and sequence. Basically, total 
mRNA from 2 X 10 6 cells from CTL clone 4H6 was isolated using Trizol reagent and 
cDNA was synthesized using Ready-to go kits (Pharmacia). To determine Valpha and 
Vbeta sequences of this clone, a panel of Valpha and Vbeta subtype specific primers 
was synthesized and used in RT-PCR reactions with cDNA generated from each of the 
clones. The RT-PCR reactions demonstrated that each of the clones expressed a 
common Vbeta sequence that corresponded to the Vbeta8 subfamily and a Valpha 
sequence that corresponded to the Valpha8 subfamily. To clone the full TCR alpha and 
beta chains from clone 4H6, primers were designed that spanned the initiator and 
terminator-coding TCR nucleotides. The primers were as follows: 

forward primer for TCR Valpha8 5' 

ggatccgccgccaccatgacatccattcgagctgta 3' (SEQ ID NO:435; has a BamHI site 
inserted); 

Kozak reverse primer for TCR Valpha8 (antisense) 5' 
gtcgactcagctggaccacagccgcag 3 5 (SEQ ID NO:436; has a Sail site inserted plus 
the TCR alpha constant sequence); 

forward primer for TCR Vbeta8 (sense) 5' 
ggatccgccgccaccatggactcctggaccttctgct 3' (SEQ ID NO:437; has a BamHI site 
inserted); and 
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Kozak reverse primer for TCR Vbeta 5' gtcgactcagaaatcctttctcttgac 3 5 
(SEQ ID NO:438; has a Sail site inserted plus the TCR beta constant sequence). 
Standard 35 cycle RT-PCR reactions were established using the cDNA synthesized 
from the CTL clone and the above primers utilizing the proofreading thermostable 
polymerase, PWO (Roche). The resultant PCR band, about 850 bp for Valpha and 
about 950 for Vbeta, was ligated into a PCR blunt vector (Invitrogen) and transformed 
into E. colu E. colt transformed with plasmids having full-length alpha and beta chains 
were identified. Large scale preparations of the corresponding plasmids were 
generated, and these plasmids were sequenced. The Valpha sequence (SEQ ID 
NO:439) was shown by nucleotide sequence alignment to be homologous to Valpha8.1, 
while the Vbeta sequence (SEQ ID NO:440) was shown by nucleotide sequence 
alignment to be homologous to Vbeta8.2. 

EXAMPLE 22 

Recombinant Expression of Full Length L762P in Mammalian Cells 

Full length L762P cDNA was subcloned into the mammalian expression 
vectors VR1012 and pCEP4 (Invitrogen). Both expression vectors had previously been 
modified to contain a FLAG epitope tag. These constructs were transfected into 
HEK293 and CHL-1 cells (ATCC) using Lipofectamine 2000 reagent (Gibco). Briefly, 
both the HEK and CHL-1 cells were plated at a density of 100,000 cells/ml in DMEM 
(Gibco) containing 10% FBS (Hyclone) and grown overnight The following day, 4|Ltl 
of Lipofectamine 2000 was added to lOOjxl of DMEM containing no FBS and incubated 
for 5 minutes at room temperature. The Lipofectamine/DMEM mixture was then added 
to ljxg of L762P Flag/pCEP4 or L762P Flag/VR1012 plasmid DNA resuspended in 
100|il DMEM and incubated for 15 minutes at room temperature. The 
Lipofectamine/DNA mix was then added to the HEK293 and CHL-1 cells and 
incubated for 48-72 hours at 37°C with 7% C0 2 . Cells were rinsed with PBS, then 
collected and pelleted by centrifugation. L672P expression was detected in the 
transfected HEK293 and CHL-1 cell lysates by Western blot analysis and was detected 
on the surface of transfected HEK cells by flow cytometry analysis. 
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For Western blot analysis, whole cell lysates were generated by 
incubating the cells in Triton-XlOO containing lysis buffer for 30 minutes on ice. 
Lysates were then cleared by centrifugation at 10,000 rpm for 5 minutes at 4°C. 
Samples were diluted with SDS-PAGE loading buffer containing beta-mercaptoethanol, 
then boiled for 10 minutes prior to loading the SDS-PAGE gel. The protein was 
transferred to nitrocellulose and probed using 1 jig/ml purified anti-L762P rabbit 
polyclonal sera (lot #690/73) or non-diluted anti-L762P mAb 153.20.1 supernatant. 
Blots were revealed using either goat anti-rabbit Ig coupled to HRP or goat anti-mouse 
Ig coupled to HRP followed by incubation in ECL substrate. 

For flow cytometric analysis, cells were washed further with ice cold 
staining buffer (PBS+1%BSA +Azide). Next, the cells were incubated for 30 minutes 
on ice with lOug/ml of purified anti-L762P polyclonal sera (lot #690/73) or a 1:2 
dilution of anti-L762P mAb 153.20.1 supernatant. The cells were washed 3 times with 
staining buffer and then incubated with a 1:100 dilution of goat anti-rabbit Ig(H+L)- 
FITC or goat anti-mouse Ig(H+L)-FITC reagent (Southern Biotechnology) for 30 
minutes on ice. After 3 washes, the cells were resuspended in staining buffer containing 
propidium iodide (PI), a vital stain that allows for the exclusion of permeable cells, and 
analyzed by flow cytometry. 

EXAMPLE 23 

Generation of Polyclonal Antibodies to Lung Tumor Antigens 

Three lung antigens, L523S (SEQ ID NO:176), L763P (SEQ ID NO:159) 
and L763 peptide #2684 (SEQ ID NO:441), were expressed and purified for use in 
antibody generation. 

L523S and L763P were expressed in an J?, coli recombinant expression 
system and grown overnight in LB Broth with the appropriate antibiotics at 37°C in a 
shaking incubator. The next morning, 10 ml of the overnight culture was added to 500 
ml of 2x YT with the appropriate antibiotics in a 2L-baffled Erlenmeyer flask. When 
the optical density of the culture reached 0.4-0.6 at 560 nanometers, the cells were 
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induced with IPTG (1 mM): Four hours after induction with IPTG, the cells were 
harvested by centrifiigation. 

The cells were then washed with phosphate buffered saline and 
centrifuged again. The supernatant was discarded and the cells were either frozen for 
future use or immediately processed. Twenty milliliters of lysis buffer was added to the 
cell pellets and vortexed. To break open the E. coli cells, this mixture was then run 
through a french press at a pressure of 16,000 psi. The cells were then centrifuged again 
and the supernatant and pellet were checked by SDS-PAGE for the partitioning of the 
recombinant protein. 

For proteins that localized to the cell pellet, the pellet was resuspended in 
10 mM Tris pH 8.0, 1% CHAPS and the inclusion body pellet was washed and 
centrifuged again. This procedure was repeated twice more. The washed inclusion body 
pellet was solubilized with either 8M urea or 6M guanidine HC1 containing 10 mM Tris 
pH 8.0 plus 10 mM imidazole. The solubilized protein was added to 5 ml of nickel- 
chelate resin (Qiagen) and incubated for 45 minutes to 1 hour at room temperature with 
continuous agitation. 

After incubation, the resin and protein mixture was poured through a 
disposable column and the flow through was collected. The column was .then washed 
with 10-20 column volumes of the solubilization buffer. The antigen was then eluted 
from the column using 8M urea, 10 mM Tris pH 8.0 and 300 mM imidazole and 
collected in 3 ml fractions. A SDS-PAGE gel was run to determine which fractions to 
pool for further purification. 

As a final purification step, a strong anion exchange resin, in this case 
Hi-Prep Q (Biorad), was equilibrated with the appropriate buffer and the pooled 
fractions from above were loaded onto the column. Each antigen was eluted off the 
column with an increasing salt gradient. Fractions were collected as the column was 
run and another SDS-PAGE gel was run to determine which fractions from the column 
to pool. 

The pooled fractions were dialyzed against 10 mM Tris pH 8.0. The 
release criteria were purity as determined by SDS-PAGE or HPLC, concentration as 
determined by Lowry assay or Amino Acid Analysis, identity as determined by amino 
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terminal protein sequence, and endotoxin level was determined by the Limulus (LAL) 
assay. The proteins were then put in vials after filtration through a 0.22-micron filter 
and the antigens were frozen until needed for immunization. 

The L763 peptide #2684 was synthesized and conjugated to KLH and 
froze until needed for immunization. 

The polyclonal antisera were generated using 400 micrograms of each 
lung antigen combined with 100 micrograms of muramyldipeptide (MDP). An equal 
volume of Incomplete Freund's Adjuvant (IF A) was added and then mixed and injected 
subcutaneously (S.C.) into a rabbit. After four weeks, the rabbit was S.C. boosted with 
200 micrograms of antigen mixed with an equal volume of IFA. Thereafter the rabbit 
was IV. boosted with 100 micrograms of antigen. The animal was bled seven days 
following each boost. The blood was then incubated at 4°C for 12-24 hours followed 
by centrifugation to generate the sera. 

The polyclonal antisera were characterized using 96 well plates coated 
with antigen and incubated with 50 microliters (typically 1 microgram/microliter) of the 
polyclonal antisera at 4°C for 20 hours. Basically, 250 microliters of BSA blocking 
buffer was added to the wells and incubated at room temperature for 2 hours. Plates 
were washed 6 times with PBS/0. 1 % Tween. The rabbit sera were diluted in PBS/0. 1 % 
Tween/0.1%BSA. 50 microliters of diluted sera was added to each well and incubated 
at room temperature for 30 minutes. The plates were washed as described above, and 
then 50 microliters of goat anti-rabbit horseradish peroxidase (HRP) at a 1:10000 
dilution was added and incubated at room temperature for 30 minutes. 

The plates were washed as described above, and 100 microliters of TMB 
Microwell Peroxidase Substrate was added to each well. Following a 15-minute 
incubation in the dark at room temperature, the colorimetric reaction was stopped with 
100 microliters of IN H 2 S0 4 and read immediately at 450 nm. All the polyclonal 
antibodies showed immunoreactivity to the appropriate antigen.Tables 7-9 show the 
antibody reactivity of rabbit antisera in serial dilution to the three lung antigens, L523S, 
L763P and L763 peptide #2684. The first column shows the antibody dilutions. The 
columns "Pre-immune sera" indicate ELISA data for two experiments using pre- 
immune sera. These results are averaged in the fourth column. The columns "anti- 
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L523S, L763P or #2684" indicate ELISA data for two experiments using sera from 
rabbits immunized as described in this Example, using the respective antigen, referred 
to as either L523S, L763P or #2684 in the tables. 



Table 7 



Antibody 


Pre- 


Pre- 


Average 


Anti- 


Anti- 


Average 


dilution 


immune 


immune 




L523S 


L523S 








sera (1) 


sera (2) 




(1) 


(2) 




1 


:1000 


0.14 


0.14 


0.14 


2.36 


2.37 


2.37 


1 


:2000 


0.12 


I 0.10 


0.11 


2.29 


2.23 


2.26 


1 


:4000 


0.10 


0.09 


0.10 


2.11 


2.17 


2.14 


1 


:8000 


0.09 


0.09 


0.09 


1.98 


2.00 


1.99 


1 


:16000 


0.09 


0.09 


0.09 


1.73 


1.76 


1.75 


1 


:32000 


0.09 


0.09 


0.09 


1.35 


1.40 


1.37 


1 


:64000 


0.09 


0.11 


0.10 


0.94 


0.98 


0.96 


1 


: 128000 


0.09 


0.08 


0.08 


0.61 


0.61 


0.61 


1 


:256000 


0.08 


0.08 


0.08 


0.38 


0.38 


0.38 


1 


:512000 


0.09 


0.08 


0.08 


0.24 


0.25 • 


0.25 


1 


:1024000 


0.08 


0.08 


0.08 


0.17 


0.17 


0.17 


1 


:2048000 


0.08 


0.08 


0.08 


0.14 


0.13 


0.13 


Table 8 


Antibody 


Pre- 


Pre- 


Average 


Anti- 


Anti- 


Average 


dilution 


inunune 


immune 




L763P 


L763P 








sera (1) 


sera (2) 




(1) 


(2) 




1 


:1000 


0.09 


0.11 


0.10 


1.97 


1.90 


1.93 


1 


2000 


0.07 


0.07 


0.07 


1.86 


1.84 


1.85 


1 


:4000 


0.06 


0.06 


0.06 


1.82 


1.81 


1.81 


1 


8000 


0.06 


0.06 


0.06 


1.83 


1.81 


1.82 


1: 


16000 


0.06 


0.05 


0.06 


1.79 


1.74 


1.76 


1: 


32000 


0.06 


0.06 


0.06 


1.56 


1.51 


1.53 


1: 


64000 


0.06 


0.05 


0.05 


1.35 


1.34 


1.35 


1: 


128000 


0.05 


0.05 


0.05 


1.01 


0.98 


0.99 


1: 


256000 


0.06 


0.05 


0.05 


0.69 


0.70 


0.70 


1: 


512000 


0.06 


0.05 


0.05 


0.47 


0.44 


0.46 


1: 


1024000 


0.06 


0.05 


0.06 


0.27 


0.27 


0.27 


1: 


2048000 


0.05 


0.05 


0.05 


0.16 


0.15 


0.16 
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Table 9 



Antibody 


Pre- 


Pre- 


Average 


Anti- 


Anti- 


Average 


dilution 


immune 


immune 




#2684 


#2684 








sera (1) 


sera (2) 




(1) 


(2) 




1 


.1000 


0.07 


0.07 


0.07 


2.10 


2.00 


2.05 


1 


:2000 


0.07 


0.06 


0.06 


1.95 


1.96 


1.95 


1 


4000 


0.06 


0.06 


0.06 


1.77 


1.82 


1.79 


1 


:8000 


0.06 


0.06 


0.06 


1.79 


1.81 


1.80 


1 


:16000 


0.06 


0.06 


0.06 


1.54 


1.50 


1.52 


1 


.32000 


0.06 


0.06 


0.06 


1.27 


1.20 


1.24 


1 


:64000 


0.06 


0.06 


0.06 


0.85 


0.82 


0.83 


0 


0.06 


0.06 


0.06 


0.06 


0.06 


0.06 



Tables 10-12 show the affinity purification of the respective antibodies 
to the three lung antigens, L523S, L763P and L763 peptide #2684. 



Table 10 



Antibody 
cone, 
(ug/ml) 


Affinity 
pure 
(salt 
peak) 


Affinity 
pure 
(salt 
peak) 


Average 


Affinity 
pure 
(acid 
peak) 


Affinity 
pure 
(acid 
peak) 


Average 


1.0 


2.38 


2.35 


2.36 


2.25 


2.31 


2.28 


0.5 


2.24 


2.22 


2.23 


2.19 


2.18 


2.18 


0.25 


2.05 


2.09 


2.07 


2.01 


2.03 


2.02 


0.13 


1.70 


1.81 


1.75 


1.74 


1.74 


1.74 


0.063 


1.44 


1.44 


1.44 


1.43 


1.38 


1.40 


0.031 


1.05 


1.05 


1.05 


0.99 


0.99 


0.99 


0.016 


0.68 


0.67 


0.68 


0.65 


0.64 


0.64 


0.0078 


0.43 


0.42 


0.42 


0.39 


0.39 


0.39 


0.0039 


0.27 


0.26 


0.27 


0.24 


0.26 


0.25 


0.0020 


0.18 


0.20 


0.19 


0.19 


0.18 


0.19 


0.0010 


0.13 


0.14 


0.13 


0.13 


0.14 


0.13 


0.00 


0.11 


0.12 


0.11 


0.10 


0.12 


0.11 
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Table 11 



Antibody 
dilution 


Affinitv 
pure 


Affinitv 
nil re 


AVvI 

• 


1 


:1000 


1.64 


1.77 


1.70 


1. 


:2000 


1.59 


1.76 


1.68 


1: 


:4000 


1.48 


1.62 


1.55 


1 


:8000 


1.35 


1.43 


1.39 


1. 


16000 


1.09 


1.19 


1.14 


1 


:32000 


0.81 


0.89 


0.85 


1 


64000 


0.55 


0.58 


0.56 


1 


:128000 


0.31 


0.35 


0.33 


1 


256000 


0.18 


0.20 


0.19 


1 


:512000 


0.11 


0.12 


0.11 


1: 


1024000 


0.07 


0.07 


0.07 


1: 


2048000 


0.06 


0.06 


0.06 



Table 12 



Antibody 
cone 
Oig/nti) 


Affinity 
pure 


Affinity 
pure 


Average 


1.0 


2.00 


2.02 


2.01 


0.5 


2.01 


1.93 


1.97 


0.25 


1.84 


1.83 


1.84 


0.13 


1.80 


1.83 


1.81 


0.06 


1.39 


1.60 


1.50 


0.03 


1.33 


1.35 


1.34 


0.02 


0.94 


0.93 


0.94 


0.00 


0.06 


0.06 


0.06 



EXAMPLE 24 
Full-Length cDNA Sequence Encoding L529S 

The isolation of a partial sequence (SEQ ID NO: 106) for lung antigen 
L529S was previously provided in Example 2. This partial sequence was used as a 
query to identify potential full length cDNA and protein sequences by searching against 
publicly available databases. The predicted full-length cDNA sequence for the isolated 
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cloned sequence of SEQ ID NO: 106 is provided in SEQ ID NO:442. The deduced 
amino acid sequence of the antigen encoded by SEQ ID NO:442 is provided in SEQ ID 
NO:443. It was previously disclosed in Example 2 that L529S shows similarity to 
connexin 26, a gap junction protein. 

EXAMPLE 25 

Expression in Megaterium of a Histidine Tag-Free L523S Fusion Protein 

PCR was performed on the L523S coding region with the following 

primers: 

Forward primer PDM-734 5' caatcaggcatgcacaacaaactgtatatcggaaac 3' 
(SEQ ID NO:444) Tm 63°C. 

Reverse primer PDM-735 5' cgtcaagatcttcattacttccgtcttgac 3' (SEQ ID 
NO:445)TM60°C. 

The PCR conditions were as follows: 
10^1 lOXPfu buffer 
l.OjxllOmMdNTPs 
2.0^1 lO^M each primer 
83jal sterile water 

1 .5\A Pfii DNA polymerase (Stratagene, La Jolla, CA) 
50r|g DNA 

96°C for 2 minutes, 96°C for 20 seconds, 62°C for 15 seconds, 72°C for 
4 minute with 40 cycles and then 72°C for 4 minutes. 

The PCR product was digested with SphI and Bgin restriction enzymes, 
gel purified and then cloned into pMEG-3, which had been digested with SphI and Bgin 
restriction enzymes. The correct construct was confirmed by DNA sequence analysis 
and then transformed into Megaterium cells for expression. 

The amino acid sequence of expressed recombinant L523S is shown in 
SEQ ID NO:446, and the DNA coding region sequence is shown in SEQ ID NO:447, 
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EXAMPLE 26 

Expression in e. Coliov a Histepine Tag-Free L523S Fusion Protein 

PCR was performed on the L552S coding region with the following 

primers: 

Forward primer PDM-733 5' cgtactagcatatgaacaaactgtatatcggaaac 3' 
(SEQ ID NO:448) Tm 64°C. 

Reverse primer PDM-4 15 5' ccatagaattcattacttccgtcttgactgagg 3 5 (SEQ 
DDNO:426)TM62°C. 

The PCR conditions were as follows: 
lOnllOXPfu buffer 
LOnllOmMdNTPs 
2.0jxl lOjiM each primer 
83|iil sterile water 

l.Sjxl Pfu DNA polymerase (Stratagene, La Jolla, 1 CA) 
5(higDNA 

96°C for 2 minutes, 96°C for 20 seconds, 62°C for 15 seconds, 72°C for 
4 minute with 40 cycles and then 72°C for 4 minutes. 

The PCR product was digested with Ndel and EcoRI restriction 
enzymes, gel purified and then cloned into pPDM, a modified pET28 vector, which had 
been digested with Ndel and EcoRI restriction enzymes. The correct construct was 
confirmed by DNA sequence analysis and then transformed into BLR pLys S and HMS 
174 pLys S cells for expression. 

The amino acid sequence of expressed recombinant L523S is shown in 
SEQ ID NO:449, and the DNA coding region sequence is shown in SEQ ID NO:450. 

EXAMPLE 27 

Epitope-Analysis of L514S and L523S-Specific Antibodies 

Peptides of candidate antigens can be used for the evaluation of antibody 
responses in both preclinical and clinical studies. These data allow one to further 
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confirm the antibody response against a certain candidate antigen. Protein-based ELIS A 
with and without competitive peptides and peptide-based ELIS A can be used to 
evaluate these antibody responses. Peptide ELISA is especially useful since it can 
further exclude the false positive of the antibody titer observed in protein-based ELISA 
as well as to provide the simplest assay system to test antibody responses to candidate 
antigens. In this example, data was obtained using both L514S- and L523S-peptides 
that show that individual cancer patients produce L514S- and L523S-specific 
antibodies. The L514S-specific antibodies recognize primarily the following epitope of 
L514S: 

aa86-110: LGKEVRDAKTTPEAFEKLGFPAAKE (SED ID 

NO:451). 

This epitope is the common epitope in humans. A rabbit antibody 
specific for L5 1 4S recognizes two addition epitopes of L5 1 4S : 

(1) aa21-45: KASDGDYYTLAVPMGDVPMDGISVA (SEQ ID 
NO:452) 

(2) aal21-135: PDRDVNLTHQLNPKVK (SED ID NO:453) 
It was further found that the SEQ ID NO:452 is common to both L514S 

isoforms, L514S-13160 and L514S-13166, whereas the other epitopes, SEQ ID NO:451 
and SEQ ID NO:453, are probably specific to the isoform, L514S-13160. 

The L523S-specific antibodies recognize primarily the following epitope 

ofL523S: 

aa440-460: KIAPAEAPDAKVRMVETGP (SEQ ID NO:454). 
This epitope is the common epitope in humans. A rabbit antibody 
specific for L523S recognizes two other epitopes: 

(1) aal56-175 PDGAAQQNNNPLQQPRG (SEQ ID 
NO:455) 

(2) aa326-345: RTITVKGNVETCAKAEEEIM (SED ID 
NO:456) 

In further studies, it was determined by peptide based ELISAs that eight 
additional epitopes of L523S were recognized by L523S-specific antibodies: 
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(1) aa40-59 AFVDCPDESWALKAIEALS (SEQ ID 
NO:457) 

(2) aa80-99: IRKLQIRNIPPHLQWEVLDS (SED ID 
NO:458) 

(3) aal60-179: AQQNPLQQPRGRRGLGQRGS (SEQ ID 
NO:459) 

(4) aal 80-199: DVHRKENAGAAEKSITILST (SED ID 
NO:460) 

(5) aa320-339: LYNPERTITVKGNVETCAKA (SEQ ID 
NO:461) 

(6) aa340-359: EEEIMKKIRESYENDIASMN (SED ID 
NO:462) 

(7) aa370-389: LNALGLFPPTSGMPPPTSGP (SEQ ID 
NO:463) 

(8) aa380-399: KIAPAEAPDAKVRMVETGP (SED ID 
NO:464) 

Out of these, six epitopes are common in both lung plural effusion fluid 
samples and in sera of lung patients. Of these six, SEQ ID NO:459 and SEQ ID 
NO:463 have no homology to other L523S-family proteins such as IGF-D mRNA- 
binding proteins 1 and 2. Accordingly, this indicates that these two peptides can be 
used as an assay system to determine the antibody response to L523S. 

EXAMPLE 28 

Generation of L523S-Specific CTL Lines Using In Vitro Whole-Gene Priming 



To determine if L523S is capable of generating a CD8 + T cell immune 
response, CTLs were generated using in vitro whole-gene priming methodologies with 
tumor antigen-vaccinia infected DC (Yee et al, The Journal of Immunology, 
157(9):4079-86, 1996), human CTL lines were derived that specifically recognize 
autologous fibroblasts transduced with the L552S tumor antigen, as determined by 
interferon-gamma ELISPOT analysis. Specifically, dendritic cells (DC) were 
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differentiated from Percoll-purifled monocytes derived from PBMC of normal human 
donors by plastic adherence and growing for five days in RPMI medium containing 
10% human serum, 50 ng/ml human GM-CSF and 30 ng/ml human IL-4. Following the 
five days of culture, the DC were infected overnight with a recombinant adenovirus that 
expresses L523S at a multiplicity of infection (M.O.I) of 33, 66 and 100, and matured 
overnight by the addition of 2 p,g/ml CD40 ligand. The virus was then inactivated by 
UV irradiation. In order to generate a CTL line, autologous PBMC were isolated and 
CD8+ T cells were enriched for by the negative selection using magnetic beads 
conjugated to CD4+, CD14+, CD16+, CD19+, CD34+ and CD56+ cells. CD8+ T cells 
specific for L523S were established in round bottom 96-well plates using 10,000 L523S 
expressing DCs and 100,000 CD8+ T cells per well in RPMI supplemented with 10% 
human serum, lOng/ml of IL-6 and 5ng/ml of IL-12. The cultures were restimulated 
every 7-10 days using autologous primary fibroblasts retrovirally transduced with 
L523S, and the costimulatory molecule CD80 in the presence of DL-2. The cells were 
also stimulated with IFN-gamma to upregulate MHC Class I. The media was 
supplemented with lOU/ml of DL-2 at the time of stimulation as well as on days 2 and 5 
following stimulation. Following three stimulation cycles, ten L523S specific CD8+ T 
cell lines were identified using interferon-gamma ELISPOT analysis that specifically 
produce interferon-gamma when stimulated with the L523S tumor antigen-transduced 
autologous fibroblasts, but not with a control antigen. 

One line, 6B1, was cloned using anti-CD3 and feeder cells. The clones 
were tested for specificity on L523S-transduced fibroblasts. In addition, using a panel 
of HLA-mismatched lines transduced with a vector expressing L523S and measuring 
interferon-gamma production by this CTL line in an ELISPOT assay, it was determined 
that this clone 6B1.4B8 is restricted by HLA-A0201. 

Also using transfected Cos cells, it was shown that clone 6B1.4B8 
recognizes Cos cells transfected with pcDNA3 HLA A0201/L523S in an HLA- 
restricted and antigen specific manner. 

An epitope mapping study demonstrated the clone 6B1.4B8 recognizes 
HLA-A201 LCL loaded with peptide pool 3 (a polypeptide corresponding to amino acid 
positions 33-59 of L523S. 
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A peptide pool breakdown study demonstrated that clone 6B1.4B8 
recognizes autologous B-LCL loaded with 15-mer peptides from amino acid positions 
. 37-55 of L523S, TGYAFVCPDESWALKAIE (SEQ ID NO:465). A further peptide 
breakdown study demonstrated that clone 6B1.4B8 recognizes T2 cells loaded with the 
same 15-mer peptides. 

A peptide recognition study demonstrated that clone 6B1.4B8 prefers T2 
cells loaded with the peptide FVDCPESWAL (SEQ ID NO:466) which is corresponds 
to the amino acid sequence at positions 41-51 of L523S and is encoded by the DNA 
sequence of SEQ ID NO:467. 

EXAMPLE 29 
L523S Expression in Other Human Cancers 

It was previously disclosed in Example 2 that L523S is expressed in lung 
cancers including squamous, adenocarcinoma and small cell carcinoma. To further 
evaluate the expression profile of this antigen an electronic express profiling was 
performed. This was done by searching a L523S-specific sequence against a public 
EST database. Results of this profiling indicate that L523S may also be present in 
colon adenocarcinomas, prostate adenocarcinomas, CML, AML, Burkitt's Lymphoma, 
brain tumors, retinoblastomas, ovarian tumors, teratocarcinomas, uterus myosarcomas, 
germ cell tumors as well as pancreatic and cervical tumor cell lines. 

EXAMPLE 30 

IMMUNOHISTOCHEMISTRY ANALYSIS OF L523S 

In order to determine which tissues express the lung tumor antigen 
L523S, immunohistochemistry QHC) analysis was performed on a diverse range of 
tissue types. Polyclonal antibodies specific for L523S (SEQ ID NO: 176) were generated 
as described in Example 23. IHC was performed essentially as described in Example 6. 
Briefly, tissue samples were fixed in formalin solution for 12-24 hours and embedded in 
paraffin before being sliced into 8 micron sections. Steam heat induced epitope 
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retrieval (SHEER) in 0.1 sodium citrate buffer (pH 6.0) was used for optimal staining 
conditions. Sections were incubated with 10% serum in PBS for 5 minutes. The 
primary L523S antibody was added to each section for 25 minutes followed by a. 25 
minute incubation with anti-rabbit biotinylated antibody. Endogenous peroxidase 
activity was blocked by three 1.5 minute incubations with hydrogen peroxidase. The 
avidin biotin complex/ horse radish peroxidase (ABC/HRP) system was used along with 
DAB chromogen to visualize antigen expression. Slides were counterstained with 
hematoxylin to visualize the cell nuclei. 

IHC analysis of L523S expression revealed that of the lung cancer 
tissues tested over 90% of tissue samples demonstrated high over-expression of the lung 
tumor antigen (10/1 1 adenocaricomas and 8/9 squamous). Of the normal tissues tested, 
all were negative for expression of L523S, with the exception of weak staining in 
normal bronchus, testis, liver, and trachea 

EXAMPLE 31 

Generation and Characterization of L762 Human Monoclonal 

Antibodies 

Cell supernatants from hybridoma fusions from the Xenomouse strain of 
transgenic mice were screened for ability to bind to L762P. All. results are shown in 
Table 13. The primary screen was to test monoclonal supernatants for reactivity to 
L762P by ELISA analysis using recombinant bacterial expressed protein. We next 
tested the human supernatants for reactivity to surface expressed L762P by whole cell 
ELISA using fluorimetry analysis. Specific reactivity of the humab supernatants was 
confirmed by performing FACS analysis on cells transfected with either an irrelevant 
plasmid or a plasmid expressing L762P. FI/CFI is the relative fold increase in 
fluorescence intensity (FI) of the anti-L762P humab primary antibody to irrelevant 
human primary antibody. FI/CFI/A20 is the relative fold increase in fluorescence 
intensity (FI) of the anti-L762P humab primary antibody to irrelevant human primary 
antibody over the FI of the anti-L762P mouse monoclonal antibody 153A20.1. 
FI/CFI/R690 is the relative fold increase in fluorescence intensity (FI) of the anti-L762P 



164 



WO 02/47534 



PCT/US01/47576 



humab primary antibody to irrelevant human primary antibody over the FI of the anti- 
L762P rabbit polyclonal antibody. FACS VRL762 is the percentage of cells transfected 
with plasmid expressing L762P that were positive following staining with indicated 
monoclonal antibody. FACS VR(-) is the percentage of cells transfected with irrelevant 
plasmid that were positive following staining with indicated monoclonal antibody. 
ELISA is the O.D. values of the indicated monoclonal antibody to recombinant L762P 
protein. The shaded rows in Table 13 indicate those antibodies that will be further 
cloned and characterized. 



Table 13: Human Monoclonal Antibodies Against L762P 



L762PHumAb 


FI/CFI 


FI/CFI/A20 


FI/CFI/R690 


FACSVRL762 


FACS VR (-) 


ELISA 


L762/VR1013 


R-690 


4.59 




1.00 










M-A20 


2.88 


1.00 












1.176 


0.51 


0.18 


0.11 






038 




1.178 


1.42 


0.49 


0.31 






035 




1.179 


0.47 


0.16 


0.10 






0.07 




1.180 


1.50 


0.52 


0.33 






0.26 




1.182 


1.45 


0.50 


032 






0.26 




1.183 


0.75 


0.26 


0.16 






0.24 




1.185 


0.89 


031 


0.19 






0.46 




1.186 ?! i ; 


3 AS n • . 


T.20 M 


' 0:75' -V- '• 


32.68 


: 7.14 


L22 


- l-^3 l T „ . 


1.187 


0.36 


0.13 


0.08 






0.06 






U.zo 


ft ftQ 


O.Uo 






0.23 




1.189 


0.50 


0.17 


0.11 






0.44 




1.190 


0.53 


0.18 


0.12 






0.42 




1.191 


3.12 . i 


1.08 


0.68 




17.90 : : £ : 


0.86 


1.29 i 


1.192 


1.91 


0.66 


0.42 






0.12 




U93 v'.?V' 


2.87 


mo ' ' n 


0,63 


17.82 - 5 


\ 6:43- " \ £, 


:(hl3 - 




1.194 


1.55 


0.54 


0.34 






0.28 




1.195 


0.14 


0.05 


0.03 






0.37 




1.196 


1.97 


0.68 


0.43 






0.89 


1.64 


1.197 


0.43 


0.15 


0.09 






0.08 




1.198 


0.54 


0.19 


0.12 






033 




1.199 


0.70 


0.24 


0.15 






0.40 




1.200 


2.00 


0.69 


0.44 






0.38 


1.56 


1.201 


1.62 


0.56 


0.35 






0.29 




1.202 


0.86 


0.30 


0.19 






036 




1.203 


1.56 


0.27 


0.18 






0.14 




1.204 


3.32 


0.58 


0.38 r 


24.83 


-6.60 


•0.17.9 


1-91. 


1.205 


2.13 


037 


0.25 






0.09 




1.206 


0.45 


0.08 


0.05 






0.23 




1.207 


0.60 


0.10 


0.07 






039 




1.208 


0.12 


0.02 


0.01 






036 




L209- 


15.52 


2.71 : 


1,80 


27.54 ■ 


9.54 .." 


0.16 


0.77 ' 


1.210 


0.92 


0.16 


0.11 






0.16 




1.211 


2.83 


0.49 ! 


033 






0.42 




1312 


3.40 


0.59 


.039 


21.68 


1136 :- 


0.14 


2.47 


1.213 


2.32 


0.40 


0.27 






0.38 




1.214 


0.80 


0.14 


0.09 






034 




1.215 


3.96 : 


0.69 . 


0.46 


38.87 


13.17 


033. 


1.80 


1.216 


1.26 


0.22 


0.15 






0.20 




1.217 


1.99 


0.35 


0.23 






0.26 
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L762PHumAb 


Fl/CFI 


FI/CFI/A20 


FI/CFI/R690 


FACSVRL762 


FACSVR(-) 


ELISA 


L762/VR1013 


1.218 


2.29 


0.40 


0.27 






0.10 




1.219 


0.15 


0.03 


0.02 






0.06 




1.220 
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ND 


139 


2.33 


1.95 


10.68 


0.15 


0.11 






0.16 




K96 


144.63. - 


- 2:05.^ • • 




63.24 : .:„ ' 


7.4.75 X 


?Q;75 


o.8or; j - 


R-690 


94.09 


1.33 


1.00 










M-A20 


70.64 


1.00 




























R-690 


7.59 




1.00 










M-A20 


5.33 


1.00 












1.97 


1.47 


0.28 


0.19 






037 




1*98 


3:69- 


0.69 ii 


0.49 ■ 


38.67** . 


1&57/V-. 


0;43 " 


1.69: / 


1199 ~ - 


4r32^ 


0.81 


0.57.:;,v^.. ; 


3831 


18.^76 


0,40 


i.48 


1.100 


0.22 


0.04 


0.03 






032 




1.101 


2.06 


0.39 


0.27 






0.49 




1.102 


0.23 


0.04 


0.03 






0.12 




1.103 


0.33 


0.06 


0.04 






0.28 




1.104 


0.45 


0.08 


0.06 






0.08 




1.105 


4,19 






3739/ 




0,25 


2.18 


1.106 


4.22 


0.79 


0.56 


46.24 


30.59 


1.21 


1.58 


1.107 


0.15 


0.03 


0.02 






0.06 




1.108 


0.08 


0.01 


0.01 






031 




1.109 


2.70 


0.51 


0.36 


6.5 


6 
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L762PHumAb 


FI/CFI 


FI/CFI/A20 


FI/CFI/R690 
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FACS VR (-) 


ELISA 


L762/VR1013 
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0.69 
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1.126 
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for 1.170 to 1.175 



Fl-fluorescence intensity of primary antibody 

CFI-fluorescence intensity of human irrelevant primary antibody. 

A20-mouse anti-L762P monoclonal antibody 

R690-rabbit anti-L762P affinity purified polyclonal antibody 

FACS VRL762-percent positive cells from transient transfection of VR1013/L762 expression plasmid 
FACS VR(-)-percent positive cells from transient transfection of empty VR1013 expression plasmid 
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EXAMPLE 32 

Epitope Mapping and Purification of hL523S-speqhc Antibodies 

This Example describes the purification of L523S antibodies that can 
distinguish between human and mouse L523S homologs and will likely distinguish 
between hL523S and hL523S-family members such as hIMP-1 and hIMP-2. 

L523S (full-length cDNA and amino acid sequence set forth in SEQ ID 
NO:347 and 348, respectively) is one of a family of proteins that includes MMP-1 and 
hIMP-2. The members of this family of proteins have a high degree of similarity one to 
the other and are also highly similar between species. Thus, generating antibodies that 
specifically recognize human L523S (hL523S) and not other members of the protein 
family in humans or the mouse homologs, has been problematic. However, in order to 
evaluate preclinical and clinical L523S DNA/Adenoviral vaccines by detecting the 
protein expression of L523S, human L523S-specific antibodies are critical. 

Polyclonal antibodies specific for hL523S were generated as described in 
Example 23. These antibodies were used to map epitopes. The epitope analysis 
showed 2 particular peptides of hL523S that were recognized, peptide 16/17 and peptide 
32. 

The amino acid sequences of both hL523S and mouse L523S (mL523S) 
peptide 16/17 and peptide 32 were then compared. Peptide 32/33 is identical between 
hL523S and mL523S. However, as the alignment below indicates, peptide 16/17 has 5 
amino acid differences between the human and mouse homologs (underlined). 

hL523S (16/17) (SEQ ID NO:468): 
I PDEMAAQQN PLQQPRGRRGLGQR 

mL523S (16/17) (SEQ ID NO:469): 
I PDETAAQQN P SPQLRGRRGPGQR 

Moreover, peptide-based ELISAs showed that peptide 17 is specifically 
recognized by lung cancer patient sera #197, and a homology search of peptide 17 
between human IMP (hIMP) family members shows that there is little similarity in this 
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region between family members. The hL523S peptide 17 (and 16/17) has less than 50% 
similarity to hL523S family members such as hIMP-1 and WMP-2. 

Based upon the epitope mapping of L523S-specific antibodies and the 
data from the homology search, hL523S or mL523S peptide 16/17-conjugated ligands 
were then used to purify human or mouse L523S-specific antibodies from rabbit 
polyclonal antibodies generated against hL523S protein as described in Example 23. 
The data from the antibodies purified by affinity chromatography using ligands 
conjugated with either hL523S-peptide 16/17 or mL523S-peptide 16/17 suggested that 
the affinity of antibodies specific to hL523S-peptide 16/17 is much higher than that of 
antibodies to mL523S-peptide 16/17 since they bind more strongly to hL523S-peptide 
16/17 than to mL523S-peptide 16/17. The difference in affinity between the purified 
antibodies to human and mouse L523S-peptide 16/17 was confirmed by peptide-based 
ELISA. The antibodies purified by hL523S-peptide 16/17 selectively bind to human 
L523S-peptide 16/17 but bind much less or not at all to mL523S-peptide 16/17. 

In order to further characterize the original polyclonal antibodies and 
antibodies purified by hL523S-peptide 16/17, immunoblot analysis was conducted using 
both human lung adenocarcinoma line as a source of hL523S protein and mouse whole 
body embryo (day 17 gestation) as the source of mL523S protein. This analysis showed 
that polyclonal antibodies specific for hL523S recognize hL523S protein expressed in 
the tumor cell line as well as mL523S protein expressed in whole body embryos of day 
17 gestation. However, the addition of hL523S peptide 32/33 blocks binding of 
antibodies to human and mouse L523S proteins. Thus, the crossreactivity of the 
polyclonal antibodies to mL523S protein is due to the existence of antibodies specific to 
hL523S peptide 32/33. In marked contrast, the purified antibodies specific to hL523S 
peptide 16/17 do not bind mL523S protein expressed in mice embryos but do recognize 
hL523S protein expressed in human lung adenocarcinoma cells. These data confirm the 
ELISA data using hL523S-peptide 16/17 and mL523S-peptide 16/17 described above. 

The amino acid sequence of hL523S peptide 16/17 used to purify the 
antibodies is about 60-70% similar to that of the mL523S-peptide 16/17 which is not 
recognized by hL523S-specific antibodies by Western blot analysis and peptide-based 
ELISA. The hL523S peptide 16/17 has less than 50% similarity to hL523S family 
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members such as hIMP-1 and hIMP-2. Taken together, these data suggest that it is 
highly probable that the antibodies purified by hL523S peptide 16/17 described herein 
will also distinguish hL523S protein from the other hL523S family members. 

In summary, antibodies purified with the hL523S peptide 16/17 do not 
recognize the mouse L523S homolog. The amino acid sequence of peptide 16/17 
between hL523S family members is less similar than between human and mouse 
L523S. Thus, the hL523S-specific antibodies described above can be used to 
distinguish between human and mouse L523S and between members of the hL523S 
family of proteins and can therefore be used for the accurate detection of hL523S 
protein expression in animals and humans. 

EXAMPLE 33 
In Vivo Immunogenecity of Lung Tumor Antigen l523 
This example describes two in vivo immunogenicity studies to evaluate 
the vaccination of mice with either an adenovirus containing L523 or with L523 naked 
DNA followed by a second immunization with an adenovirus containing L523. 

The first study involved the immunization of two strains of mice with 
L523 adenovirus. The C57B16 strain of mice is homozygous for HLA-type H-2 b , while 
strain B6D2(F1) is heterozygous for the HLA-type, H-2 b/d . Table 14 describes the 
initial immunization strategy employed. 



Table 14: Immunization with L523 Adenovirus alone: Experimental 
Design 



Group 


Immunization 


Strain (4/group) 


1 


10 8 PFUAdL523A 


C57BL6 


2 


10 7 PFU Ad hrGFP A 


C57BL6 


3 


10 8 PFUAdL523A 


B6D2(F1) 


4 


10 7 PFU Ad hrGFP A 


B6D2(F1) 


5 


Nai've 


C57BL6 


6 


Naive 


B6D2(F1) 



PFU=plaque forming unit; GFP=green fluorescent protein; Ad=adenovirus. 
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Mice were immunized intradermally with either 10 PFU of L523- 
adenovirus or 10 7 PFU of an irrelevant adenovirus (hrGFP). Three weeks following 
immunization, IgGl and IgG2a antibody responses to L523 were examined in all groups 
of mice. Briefly, recombinant fall length L523 (rL523) was coated onto ELISA plates 
and serum, at multiple dilutions, was added to the wells. Following a 60-minute 
incubation, the serum was washed from the wells and a secondary antibody, either 
sj>ecific for an IgGl or IgG2a was added to the plates. Both antibodies were directly 
conjugated to horseradish peroxide (HRP). The levels of L523 antibodies, either IgGl 
or IgG2a, were measured in all groups. In the C57BL6 mice, little to no L523-specific 
antibodies were detected following immunization. However, in the B6D2(F1) strain of 
mice immunized with L523 adenovirus, both IgGl and IgG2a L523-specific antibodies 
were detected at serum dilution as low as 1/1000. 

In addition to detecting L523-specific antibodies in the serum, 
interferon-gamma (TFN-y) responses were assayed from immune spleen cells following 
in vitro stimulation with rL523 protein. Briefly, spleen cells were harvested from all 
mice groups and cultured for 3 days in 96-well plates. Culture conditions included, 
media alone, 1 or lOfig/ml of rL523 protein, or 5[xg/ml of concanavalin A (Con A). 
After 3 days, the supernatants were harvested and assayed for IFN-y levels in the 
supernatants. 

Immunization with L523-adenovirus, but not an irrelevant adenovirus, 
elicited a strong IFN-y response from the spleen cells which were stimulated with 
rL523. In general, responses were stronger in the B6D2(F1) mouse strain, as evidenced 
by both a higher level of IFN-y production, as well as the fact that stimulation with a 
lower antigen concentration (ljig/ml) elicited an equally strong response as seen with 
the higher antigen concentration (lO^ig/ml). 

Finally, T cell proliferation responses were assayed from immune spleen 
cells by stimulation in vitro with rL523 protein. Briefly, spleen cells were cultured for 4 
days in 96-well plates with, media alone, 1 or lO^g/ml of rL523 protein, or Con A. The 
cultures were then pulsed with 3H-thymidine for the final 8 hours of culture. Results 
are represented as the stimulation index (SI) in the presence of antigen relative to 
stimulation with media alone. Results were consistent with those obtained in the IFN-y 
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assay. Immunization with L523-adenovirus, but not an irrelevant adenovirus, elicited a 
proliferation response in spleen cells stimulated with rL523. A strong SI (average of 
>20) was observed in spleen cells harvested from the B6D2(F1) mouse strain, with 
similar levels of proliferation observed at both protein concentrations. Little or no T 
cell proliferation was observed in the C57BL6 mouse strain. 

A second study involved the immunization of two strains of mice 
initially with L523 naked DNA followed by a second immunization with L523 
adenovirus two weeks later. The mice were harvested 3 weeks after the boost. Table 
1 5 describes the immunization regimen of the second study. 



Table 15: Immunization with L523 DNA followed by a second 
immunization with L523-Adenovirus: Experimental Design 



Group 


. Immunization 


Strain (4/group) 


1 


L523 DNA +10 8 PFU Ad L523 A 


C57BL6 


2 


10 8 PFUAdL523A 


C57BL6 


3 


Irrelevant DNA + 10 7 PFU Ad hrGFP A 


C57BL6 


4 


10 7 PFU Ad hrGFP A 


C57BL6 


5 


Naive 


C57BL6 


6 


L523 DNA +10 8 PFU Ad L523 A 


B6D2(F1) 


7 


10 8 PFU Ad L523 A 


B6D2(F1) 


8 


Irrelevant DNA + 10 7 PFU Ad hrGFP A 


B6D2(F1) 


9 


10 7 PFU Ad hrGFP A 


B6D2(F1) 


10 


Naive 


B6D2(F1) 



PFU=plaque forming unit; GFP=green fluorescent protein; Ad^adenovirus. 

As described in the first study, strong IgGl and IgG2a antibody 



responses were observed in B6D2(F1) mice following immunization with L523- 
adenovirus. Immunizing with L523 DNA appeared to increase the overall L523- 
specific antibody response compared to responses achieved with immunization with 
L523-adenovirus alone. C57BL6 mice elicited little or no L523-specific antibody 
responses following immunization with L523-adenovirus, but were some slightly 
positive responses were detected in mice immunized with L523 DNA followed by a 
second immunization with L523-adenovirus. 



176 



WO 02/47534 



PCT/US01/47576 



IFN-y responses were assayed from immune spleen cells by stimulation 
in vitro with rL523 protein. These results confirm those observed in the initial study 
demonstrating the immunogenecity of L523 in animals. The results also suggest that 
initially immunizing the animals with L523 DNA, prior to immunization with L523- 
adeonvirus, does not significantly increase the CD4 response. As with the.initial study, 
responses appear to be stronger in the B6D2(F1) strain of mice than the C57BL6 strain. 

As with the initial study, T cell proliferation responses were assayed 
from immune spleen cells by stimulation in vitro with rL523 protein. The results from 
using two rounds of immunization are consistent with those obtained from the first 
study. Immunization with L523 DNA prior to a second round of immu nization with 
L523-adenovirus did not significantly increase the proliferation responses generated in 
the mice. As with the first study, responses were stronger in the B6D2(F1) mouse strain 
than in the C57BL6 strain. 

The difference in HLA types between the two strains of mice could 
explain variations in the extent of the immune responses detected. As described above, 
the C57BL6 strain is homozygous for H-2 b , while the B6D2(F1) is heterozygous for H- 
2 b/d . The increased diversity of the B6D2(F1) strains HLA type allows for a greater 
number of epitopes derived from the L523 protein to be presented In this strain, 
epitopes specific for both H-2 b and H-2 d can be presented, while only H-2 b epitopes can 
be presented by the C57BL6 strain. 

From the foregoing it will be appreciated that, although specific 
embodiments of the invention have been described herein for purposes of illustration, 
various modifications may be made without deviating from the spirit and scope of the 
invention. Accordingly, the invention is not limited except as by the appended claims. 
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CLAIMS 

What is Claimed : 

1 . A method for inducing an immune response in an animal, 

comprising: 

a) providing a composition comprising a polynucleotide encoding at 
least an immunogenic portion of a lung carcinoma polynucleotide wherein the 
polynucleotide has at least 90%identity with SEQ ID NO:347; 

b) administering said polynucleotide; and 

c) thereby inducing an immune response in an animal. 



2. The method of claim 1, wherein said composition further 
comprises a component selected from the group consisting of a physiologically 
acceptable carrier or an adjuvant. 



3. A method according to claim 1 , wherein the lung carcinoma 
polynucleotide is delivered by a viral based delivery system. 

4. A method according to claim 3, wherein the viral based delivery 
system is an adenovirus. 

5. The method of claim 1, wherein the immune response induced is 
a CD4+ T helper response. 

6. The method of claim 1, wherein the immune response induced is 
a CD8+ cytotoxic T lymphocyte response. 

7. The method of claim 1 , wherein the immune response induced is 
both a CD4+ T helper and CD8+ cytotoxic T cell immune response. 
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8. An isolated polynucleotide comprising a sequence selected from 
the group consisting of: 

(a) sequences provided in SEQ ID NO:351, 353, 358, 362, 364, 366, 
368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467; 

(b) complements of the sequences provided in SEQ ID NO:351, 353, 
358, 362, 364, 366, 368, 370-375, 420, 424, 428, 43 1, 434, 442, 447, 450 and 467; 

(c) sequences consisting of at least 10 contiguous residues of a 
sequence provided in SEQ ID NO:351, 353, 358, 362, 364, 366, 368, 370-375, 420, 
424, 428, 431, 434, 442, 447, 450 and 467; 

(d) sequences that hybridize to a sequence provided in SEQ ID 
NO:351, 353, 358, 362, 364, 366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 
and 467, under highly stringent conditions; 

(e) sequences having at least 75% identity to a sequence of SEQ ID 
NO:351, 353, 358, 362, 364, 366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 
and 467; 

(f) sequences having at least 90% identity to a sequence of SEQ ID 
NO:351, 353, 358, 362, 364, 366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 
and 467; and 

(g) degenerate variants of a sequence provided in SEQ ID NO:351, 
353, 358, 362, 364, 366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467. 

9. An isolated polypeptide comprising an amino acid sequence 
selected from the group consisting of: 

(a) sequences having at least 90% identity to a polypeptide having an 
amino acid sequence of any one of the sequences provided in SEQ ID NO:352, 354, 
357, 361, 363, 365, 367, 369, 376-382, 387-419, 423, 427, 430, 433, 441, 443, 446, 
449, 451-466 and 468-469; 

(b) sequences encoded by a polynucleotide of claim 8 ; 

(c) sequences having at least 70% identity to a sequence encoded by 
a polynucleotide of claim 8 ; and 
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(d) sequences having at least 90% identity to a sequence encoded by 
a polynucleotide of claim 8. 

10. An expression vector comprising a polynucleotide of claim 8 
operably linked to an expression control sequence. 

11. A host cell transformed or transfected with an expression vector 
according to claim 10. 

12. An isolated antibody, or antigen-binding fragment thereof, that 
specifically binds to a polypeptide of claim 9. 

13. A method for detecting the presence of a cancer in a patient, 
comprising the steps of: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample with a binding agent that binds 
to a polypeptide of claim 9; 

(c) detecting in the sample an amount of polypeptide that binds to 
the binding agent; and 

(d) comparing the amount of polypeptide to a predetermined cut-off 
value and therefrom determining the presence of a cancer in the patient 

14. A fusion protein comprising at least one polypeptide according to 

claim 9. 

15. A fusion protein according to claim 14, wherein the fusion 
protein is selected from the group consisting sequences provided in SEQ ID NO:352, 
354,423, 427, 430 and 433. 
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16. An oligonucleotide that hybridizes to a sequence recited in SEQ 
ID NO:351, 353, 358, 362, 364, 366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 
450 and 467 under highly stringent conditions. 

17. A method for stimulating and/or expanding T cells specific for a 
tumor protein, comprising contacting T cells with at least one component selected from 
the group consisting of: 



to claim 8, 

under conditions and for a time sufficient to permit the stimulation 
and/or expansion of T cells. 

18. An isolated T cell population, comprising T cells prepared 
according to the method of claim 1 7. 

19. A composition comprising a first component selected from the 
group consisting of physiologically acceptable carriers and immunostimulants, and a 
second component selected from the group consisting of: 



(a) polypeptides according to claim 9; 

(b) polynucleotides according to claim 8; 

(c) antibodies according to claim 12; 

(d) fusion proteins according to claim 14; 

(e) T cell populations according to claim 1 8; and 

(f) antigen presenting cells that express a polypeptide according to 



20. A method for stimulating an immune response in a patient, 
comprising administering to the patient a composition of claim 19. 



(a) 
(b) 
(c) 



polypeptides according to claim 9; 

polynucleotides according to claim 8; and 

antigen-presenting cells that express a polynucleotide according 



claim 9. 
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21. A method for the treatment of a lung cancer in a patient, 
comprising administering to the patient a composition of claim 19. 

22. A method for determining the presence of a cancer in a patient, 
comprising the steps of: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample with an oligonucleotide 
according to claim 9; 

(c) detecting in the sample an amount of a polynucleotide that 
hybridizes to the oligonucleotide; and 

(d) compare the amount of polynucleotide that hybridizes to the 
oligonucleotide to a predetermined cut-off value, and therefrom determining the 
presence of the cancer in the patient 

23. A diagnostic kit comprising at least one oligonucleotide 
according to claim 16. 

24. A diagnostic kit comprising at least one antibody according to 
claim 12 and a detection reagent, wherein the detection reagent comprises a reporter 
group. 

25. A method for the treatment of lung cancer in a patient, 
comprising the steps of: 

(a) incubating CD4+ and/or CD8+ T cells isolated from a patient 
with at least one component selected from the group consisting of: (i) polypeptides 
according to claim 9; (ii) polynucleotides according to claim 8; and (iii) antigen 
presenting cells that express a polypeptide of claim 9, such that T cell proliferate; 

(b) administering to the patient an effective amount of the 
proliferated T cells, 

and thereby inhibiting the development of a cancer in the patient 
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SEQUENCE LISTING 

<110> Corixa Corporation 
Wang, Tongtong 
Wang, Aijun 
Skeiky, Yasir A.W. 
Li, Samual X. 
Kalos, Michael D. 
Henderson, Robert A. 
McNeill, Patricia D. 
Fanger, Neil 
Retter, Marc W. 
Durham, Margarita 
Fanger, Gary R. 
Vedvick, Thomas S. 
Carter, Darrick 
Watanabe, Yoshihiro 
Peckman, David W. 
Cai, Feng 
Foy, Teresa M. 



<120> COMPOSITIONS AND METHODS FOR THE THERAPY 
AND DIAGNOSIS OF LUNG CANCER 



<130> 210121. 45503PC 

<140> PCT 

<141> 2001-11-30 

<160> 469 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 315 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 
<222> 236, 241 
<223> n « A,T,C or G 

<400> 1 

gcagagacag actggtggtt 
cagctgccgt gagactcccg 
ttcatctcca gcagagacaa 
gttaatatgt ttgtaaactc 
naaattacaa atagaatcat 
aaaaaaaaaa aaaaa 



gaacctggag gtgccaaaaa 
atgtcacagg cagtctgtgt 
cggaggaggc tcccaccagg 
atgtacagtt ttttttgggg 
ttgctgtaat ccttaaatgg 



agccagctgc gggcccagga 60 

ggttacagcg cccctcagtg 120 

acggttctca ttatttatat 180 

gggaagcaat gggaanggta 240 

caaacggtca ggccacgtga 300 

^ 315 



<210> 2 
<211> 380 
<212> DNA 
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<213> Homo sapiens 



<400> 2 

atttaggctt 

atatatataa 

cataactttt 

ctgaaatgag 

ggataaattc 

ttattggaaa 

gtaaaaaaaa 



aagattttgt 
acaaatacaa 
aacaacactg 
tggatgtata 
ccagtctaga 
ttttgtcctc 
aaaaaaaaaa 



ttacccttgt 
aaagttttga 
ctctgtaatg 
gtgttattgc 
ttattagcct 
tgtaactggc 



tactaaggag 
gtggttcagc 
ggttgaactg 
ataattatcc 
ttgttaacca 
actttggggt 



caaattagta 
ttttttattt 
tggtactcag 
cactatgaag 
tcaagcacct 
gtgacttatc 



ttaaagtata 60 
tttttaatgg 120 
actgagataa 180 
caaagggact 240 
agaagaagaa 300 
ttttgccttt 360 
380 



<210> 3 
<211> 346 
<212> DNA 

<213> Homo sapiens 

<220> i 
<221> misc_feature 

<222> 316, 317, 318, 322, 323, 326, 329, 330, 331, 336, 337, 339, 

340, 342, 343 

<223> n = A,T,C or G 



<400> 3 

ttgtaagtat 

catcctcacc 

atacaattgt 

gaagtttcct 

gacttccaac 

gcaataattt 



acaattttag 
atacaccatc 
actttctttg 
taatggaatg 
aatttgatca 
ctattnnnag 



aaaggattaa 
cactttccaa 
gattttcata 
agtcattttt 
tatagtgttg 
aim c cng gnn 



atgttattga 
taacatttaa 
acaaatatac 
gtcttgtgct 
agctgtggaa 
naaaannann 



tcattttact 
tcctttctaa 
catagactgt 
tttgaggtta 
atctttaagt 
annaaa 



gaatactgca 60 
aattgtaagt 120 
taattttatt 180 
cctttgcttt 240 
ttattctata 300 
346 



<210> 4 
<211> 372 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 297, 306, 332 
<223> n = A, T, C or G 



<400> 4 

actagtctca ttactccaga attatgctct tgtacctgtg tggctgggtt tcttagtcgt 60 
tggtttggtt tggttttttg aactggtatg tagggtggtt cacagttcta atgtaagcac 120 
tctcttctcc aagttgtgct ttgtggggac aatcattctt tgaacattag agaggaaggc 180 
agttcaagct gttgaaaaga ctattgctta tttttgtttt taaagaccta cttgacgtca 240 
tgtggacagt gcacgtgcct tacgctacat cttgttttct aggaagaagg ggatgcnggg 300 
aaggantggg tgctttgtga tggataaaac gnctaaataa cacaccttta cattttgaaa 360 
aaaacaaaac aa 372 

<210> 5 

<211> 698 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__feature 

<222> 8, 345, 422, 430, 433, 436, 438, 472, 481, 486, 515, 521, 
536, 549, 553, 556, 557, 559, 568, 593, 597, 605, 611, 613, 
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616, 618, 620, 628, 630, 632, 634, 635, 639, 643, 647, 648, 
649, 652, 654, 658, 664, 690 
<223> n = A, T,C or G 

<400> 5 

actagtanga tagaaacact gtgtcccgag agtaaggaga gaagctacta ttgattagag 60 
cctaacccag gttaactgca agaagaggcg ggatactttc agctttccat gtaactgtat 120 
gcataaagcc aatgtagtcc agtttctaag atcatgttcc aagctaactg aatcccactt 180 
caatacacac tcatgaactc ctgatggaac aataacaggc ccaagcctgt ggtatgatgt 240 
gcacacttgc tagactcaga aaaaatacta ctctcataaa tgggtgggag tattttgggt 300 
gacaacctac tttgcttggc tgagtgaagg aatgatattc atatnttcat ttattccatg 360 
gacatttagt tagtgctttt tatataccag gcatgatgct gagtgacact cttgtgtata 42 0 
tntccaaatn ttngtncngt cgctgcacat atctgaaatc ctatattaag antttcccaa 480 
natgangtcc ctggtttttc cacgccactt gatcngtcaa ngatctcacc tctgtntgtc 540 
ctaaaaccnt ctnctnnang gttagacngg acctctcttc tcccttcccg aanaatnaag 600 
tgtgngaaga nanccncncn cccccctncn tncnncctng ccngctnnnc cncntgtngg 660 
gggngccgcc cccgcggggg gacccccccn ttttcccc 698 

<210> 6 

<211> 740 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 82, 406, 426, 434, 462, 536, 551, 558, 563, 567, 562, 584, 
592, 638, 651, 660, 664, 673, 675, 697, 706, 711, 715, 716, 
717, 723, 724, 725, 733 
<223> n = A,T,C or G 

<400> 6 

actagtcaaa aatgctaaaa taatttggga gaaaatattt tttaagtagt gttatagttt 60 

catgtttatc ttttattatg tnttgtgaag ttgtgtcttt tcactaatta cctatactat 120 

gccaatattt ccttatatct atccataaca tttatactac atttgtaaga gaatatgcac 180 

gtgaaactta acactttata aggtaaaaat gaggtttcca agatttaata atctgatcaa 240 

gttcttgtta tttccaaata gaatggactt ggtctgttaa ggggctaagg gagaagaaga 300 

agataaggtt aaaagttgtt aatgaccaaa cattctaaaa gaaatgcaaa aaaaaattta 360 

ttttcaagcc ttcgaactat ttaaggaaag caaaatcatt tcctanatgc atatcatttg 420 

tgagantttc tcantaatat cctgaatcat tcatttcagc tnaggcttca tgttgactcg 480 

atatgtcatc tagggaaagt ctatttcatg gtccaaacct gttgccatag ttggtnaggc 540 

tttcctttaa ntgtgaanta ttnacangaa attttctctt tnanagttct tnatagggtt 600 

aggggtgtgg gaaaagcttc taacaatctg tagtgttncg tgttatctgt ncagaaccan 660 

aatnacggat cgnangaagg actgggtcta tttacangaa cgaatnatct ngttnnntgt 720 
gtnnncaact ccngggagcc 740 

<210> 7 
<211> 670 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<222> 265, 268, 457, 470, 485, 546, 553, 566, 590, 596, 613, 624, 
639, 653, 659, 661 
<223> n = A,T,C or G 

<400> 7 

gctggggagc tcggcatggc ggtccccgct gcagccatgg ggccctcggc gttgggccag 60 
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agcggccccg gctcgatggc cccgtggtgc 
cttgggatgc aggagctgtt ccggggccac 
ccaaggtgca ctcggtggcc tggagttgcg 
aagacgccac gtcttcttgc tgganaanga 
catggggata gtgtggacca ctttgttggc 
cgtctggaga taaaaccatt cgcatctggg 
tgaacactaa aggggagaac attaatatct 
tagcnacaag gatgatgtgg tgactttatt 
aaacanttcc aanttcgaag tcaccnaaat 
tcctgacaat ggnccttggg tgtntcacat 
natccacccc 



tcagtgagca gcggcccgtc gcgctacgtg 120 
agcaagaccg cgagttcctg gcgcacagcg 180 
acgggcgtcg cctacctcgg ggtcttcgac 240 
ccgttggtca aagaaaacaa ttatcgggga 300 
atccaagtaa tcctgaccta tttgttacgg 360 
atgtgaggac tacaaaatgc attgccactg 420 
gctggantcc tgatgggcan accattgctg 480 
gatgccaaga aaccccgttc caaagcaaaa 540 
ctcctggaac aatgaacatn aatatnttct 600 
cctcagctnc cccaaaactg aancctgtnc 660 

67 0 



<210> 8 

<211> 689 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 253, 335, 410, 428, 448, 458, 466, 479, 480, 482, 483, 485, 
488, 491, 492, 495, 499, 500, 502, 503, 512, 516, 524, 525, 
526, 527, 530, 540, 546, 550, 581, 593, 594, 601, 606, 609, 
610, 620, 621, 622, 628, 641, 646, 656, 673 
<223> n = A,T,C or G 

<400> 8 

actagtatct aggaatgaac agtaaaagag gagcagttgg ctacttgatt acaacagagt 60 
aaatgaagta ctggatttgg gaaaacctgg ttttattaga acatatggaa tgaaagccta 120 
cacctagcat tgcctactta gccccctgaa ttaacagagc ccaattgaga caaacccctg 180 
gcaacaggaa attcaaggga gaaaaagtaa gcaacttggg ctaggatgag ctgactccct 240 
tagagcaaag ganagacagc ccccattacc aaataccatt tttgcctggg gcttgtgcag 300 
ctggcagtgt tcctgcccca gcatggcacc ttatngtttt gatagcaact tcgttgaatt 360 
ttcaccaact tattacttga aattataata tagcctgtcc gtttgctgtn tccaggctgt 420 
gatatatntt cctagtggtt tgactttnaa aataaatnag gtttantttt ctccccccnn 480 
cnntnctncc nntcnctcnn cnntcccccc cnctcngtcc tccnnnnttn gggggggccn 54 0 
cccccncggn ggacccccct ttggtccctt agtggaggtt natggcccct ggnnttatcc 600 
nggccntann tttccccgtn nnaaatgntt ccccctccca ntcccnccac ctcaanccgg 660 
aagcctaagt ttntaccctg ggggtcccc 689 

<210> 9 

<211> 674 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 602, 632, 639, 668 
<223> n = A,T,C or G 



<400> 9 

gtccactctc ctttgagtgt actgtcttac 
taaaaaatgc ttgttctata gtggagtaag 
gaaaaaagcg aggctttttt gccaccttgg 
ataagcctga agggaagtag ctatgagact 
ccttcatgga aaaaggcttc ctgtaataat 
ttctgaaata agagacaaat tgggccgcag 
aaagttttgt ttggtcttca ccaaaggaca 
caaaaacatt agctgttctg tctttcaatt 



tgtgcactct gtttttcaac tttctagata 60 
agctcacaca cccaaggcag caagataact 120 
taaaggccag ttcactgcta tagaactgct 180 
ttccattttt cttagttctc ccaataggct 240 
tttcacctaa tgaattagca gtgtgattat 300 
agtcttcctg tgatttaaaa taaacaaccc 360 
tactctaggg ggtatgttgt tgaagacatt 420 
tcaagttatt ttggagactg cctccatgtg 480 
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agttaattac tttgctctgg aactagcatt attgtcatta. tcatcacatt ctgtcatcat 540 

catctgaata atattgtgga tttccccctc tgcttgcatc ttcttttgac tcctctggga 600 

anaaatgtca aaaaaaaagg tcgatctact cngcaaggnc catctaatca ctgcgctgga 660 

aggacccnct gccc 674 

<210> 10 

<211> 346 s 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 320, 321, 322, 325, 326, 328, 329, 330, 332, 333, 334, 335, 
342 

<223> n = A,T,C or G 



<400> 10 

actagtctgc tgatagaaag cactatacat 
ttctgtctgt aacaaaaatg tactttatag 
ccttaagtgt ttctgtcatt gttcaagtgt 
tttttctttt ccccttataa attgtaattc 
tgtcagatta tattatctaa caattgaata 
aaagggtact tttctattan nnagnngnnn 

<210> 11 

<211> 602 

<212> DNA 

<213> Homo sapiens 



cctattgttt ctttctttcc aaaatcagcc 60 
agatggagga aaaggtctaa tactacatag 120 
attttctgta acagaaacat atttggaatg 180 
ctgaaatact gctgctttaa aaagtcccac 240 
ttgtaaatat acttgtctta cctctcaata 300 
gnnnnataaa anaaaa 346 



<400> 11 

actagtaaaa agcagcattg ccaaataatc cctaattttc cactaaaaat ataatgaaat 60 
gatgttaagc tttttgaaaa gtttaggtta aacctactgt tgttagatta atgtatttgt 120 
tgcttccctt tatctggaat gtggcattag cttttttatt ttaaccctct ttaattctta 180 
ttcaattcca tgacttaagg ttggagagct aaacactggg atttttggat aacagactga 240 
cagttttgca taattataat cggcattgta catagaaagg atatggctac cttttgttaa 300 
atctgcactt tctaaatatc aaaaaaggga aatgaagtta taaatcaatt tttgtataat 360 
ctgtttgaaa catgagtttt atttgcttaa tattagggct ttgccccttt tctgtaagtc 420 
tcttgggatc ctgtgtagaa ctgttctcat taaacaccaa acagttaagt ccattctctg 480 
gtactagcta caaattcggt ttcatattct acttaacaat ttaaataaac tgaaatattt 540 
ctagatggtc tacttctgtt catataaaaa caaaacttga tttccaaaaa aaaaaaaaaa 600 
aa 602 

<210> 12 

<211> 685 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 170, 279, 318, 321, 322, 422, 450, 453, 459, 467, 468, 470, 
473, 475, 482, 485, 486, 491, 498, 503, 506, 509, 522, 526, 
527, 528, 538, 542, 544, 551, 567, 568, 569, 574, 576, 582, 
587, 588, 589, 590, 592, 593, 598, 599, 603, 605, 608 
<223> n = A,T,C or G 

<221> misc_£eature 

<222> 633, 634, 635, 644, 646, 648, 651, 655, 660, 662, 663, 672, 
674, 675, 682, 683 
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<223> n = A,T,C or G 



<400> 12 

actagtcctg tgaaagtaca actgaaggca gaaagtgtta ggattttgca tctaatgttc 60 
at tat cat gg tattgatgga cctaagaaaa taaaaattag actaagcccc caaataagct 120 
gcatgcattt gtaacatgat tagtagattt gaatatatag atgtagtatn ttgggtatct 180 
aggtgtttta tcattatgta aaggaattaa agtaaaggac tttgtagttg tttttattaa 240 
atatgcatat agtagagtgc aaaaatatag caaaaatana aactaaaggt agaaaagcat 300 
tttagatatg ccttaatnta nnaactgtgc caggtggccc tcggaataga tgccaggcag 360 
agaccagtgc ctgggtggtg cctccccttg tctgcccccc tgaagaactt ccctcacgtg 420 
angtagtgcc ctcgtaggtg tcacgtggan tantggganc aggccgnncn gtnanaagaa 480 
ancanngtga nagtttcncc gtngangcng aactgtccct gngccnnnac gctcccanaa 54 0 
cntntccaat ngacaatcga gtttccnnnc tccngnaacc tngccgnnnn cnngcccnnc 600 
cantntgnta accccgcgcc cggatcgctc tcnnntcgtt ctcncncnaa ngggntttcn 660 
cnnccgccgt cncnnccccg cnncc 685 

<210> 13 

<211> 694 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 503, 546, 599, 611, 636, 641, 643, 645, 656, 658, 662, 676, 
679, 687 

<223> n = A, T,C or G 



<400> 13 

cactagtcac tcattagcgt tttcaatagg gctcttaagt ccagtagatt acgggtagtc 60 

agttgacgaa gatctggttt acaagaacta attaaatgtt tcattgcatt tttgtaagaa 120 

cagaataatt ttataaaatg tttgtagttt ataattgccg aaaataattt aaagacactt 180 

tttctctgtg tgtgcaaatg tgtgtttgtg atccattttt tttttttttt taggacacct 240 

gtttactagc tagctttaca atatgccaaa aaaggatttc tccctgaccc catccgtggt 300 

tcaccctctt ttccccccat gctttttgcc ctagtttata acaaaggaat gatgatgatt 360 

taaaaagtag ttctgtatct tcagtatctt ggtcttccag aaccctctgg ttgggaaggg 420 

gatcattttt tactggtcat ttccctttgg agtgtactac tttaacagat ggaaagaact 480 

cattggccat ggaaacagcc gangtgttgg gagccagcag tgcatggcac cgtccggcat 540 

ctggcntgat tggtctggct gccgtcattg tcagcacagt gccatgggac atggggaana 600 

ctgactgcac ngccaatggt tttcatgaag aatacngcat ncncngtgat cacgtnancc 660 

angacgctat gggggncana gggccanttg cttc 694 

<210> 14 

<211> 679 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 29, 68, 83, 87, 94, 104, 117, 142, 145, 151, 187, 201, 211, 
226, 229, 239, 241, 245, 252, 255, 259, 303, 309, 359, 387, 
400, 441, 446, 461, 492, 504, 505, 512, 525, 527, 533, 574, 
592, 609, 610, 618, 620, 626, 627, 633, 639, 645, 654 
<223> n « A,T,C or G 



<400> 14 

cagccgcctg catctgtatc cagcgccang 
agtcccgnac ccgttcggcc cangctnagt 
ccaagtgcat caaatacctg cngtncggat 



tcccgccagt cccagctgcg cgcgcccccc 60 
tagncctcac catnccggtc aaaggangca 120 
ntaaattcat cttctggctt gccgggattg 180 
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ctgtccntgc cattggacta nggctccgat ncgactctca gaccanganc atcttcganc 240 
naganactaa tnatnattnt tccagcttct acacaggagt ctatattctg atcggatccg 300 
gcnccctcnt gatgctggtg ggcttcctga gctgctgcgg ggctgtgcaa gagtcccant 360 
gcatgctggg actgttcttc ggcttcntct tggtgatatn cgccattgaa atacctgcgg 420 
ccatctgggg atattccact ncgatnatgt gattaaggaa ntccacggag ttttacaagg 480 
acacgtacaa cnacctgaaa accnnggatg anccccaccg ggaancnctg aangccatcc 540 
actatgcgtt gaactgcaat ggtttggctg gggnccttga acaatttaat cncatacatc 600 
tggccccann aaaggacntn ctcganncct tcnccgtgna attcngttct gatnccatca 660 
cagaagtctc gaacaatcc 679 

<210> 15 

<211> 695 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 105, 172, 176, 179, 189, 203, 212, 219, 221, 229, 231, 238, 
242, 261, 266, 270, 278, 285, 286, 298, 311, 324, 337, 350, 
363, 384, 391, 395, 405, 411, 424, 427, 443, 448, 453, 455, 
458, 463, 467, 470, 479, 482, 484, 493, 499, 505, 518 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 520, 523, 531, 540, 584, 595, 597, 609, 611, 626, 628, 651, 
652, 657, 661, 665, 669, 672, 681, 683, 691, 693 
<223> n - A,T,C or G 



<400> 15 

actagtggat aaaggccagg gatgctgctc aacctcctac catgtacagg gacgtctccc 60 
cattacaact acccaatccg aagtgtcaac tgtgtcagga ctaanaaacc ctggttttga 120 
ttaaaaaagg gcctgaaaaa aggggagcca caaatctgtc tgcttcctca cnttantcnt 180 
tggcaaatna gcattctgtc tcnttggctg cngcctcanc ncaaaaaanc ngaactcnat 240 
cnggcccagg aatacatctc ncaatnaacn aaattganca aggcnntggg aaatgccnga 300 
tgggattatc ntccgcttgt tgancttcta agtttcnttc ccttcattcn accctgccag 360 
ccnagttctg ttagaaaaat gccngaattc naacnccggt tttcntactc ngaatttaga 420 
tctncanaaa cttcctggcc acnattcnaa ttnanggnca cgnacanatn ccttccatna 480 
ancncacccc acntttgana gccangacaa tgactgcntn aantgaaggc ntgaaggaan 540 
aactttgaaa ggaaaaaaaa ctttgtttcc ggccccttcc aacncttctg tgttnancac 600 
tgccttctng naaccctgga agcccngnga cagtgttaca tgttgttcta nnaaacngac 660 
ncttnaatnt cnatcttccc nanaacgatt ncncc 695 

<210> 16 

<211> 669 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 299, 354, 483, 555, 571, 573, 577, 642, 651, 662, 667 
<223> n = A,T,C or G 



<400> 16 

cgccgaagca gcagcgcagg ttgtccccgt 
ttcccgggcc ccttacactc cacagtcccg 
agaaccctgc ggaggagacc ggcgaggaga 
tgcctgagag agctgaagag gcaaagctaa 
ctggaggctc cgacttcctc atgaagagac 



ttcccctccc ccttcccttc tccggttgcc 60 
gtcccgccat gtcccagaaa caagaagaag 120 
agcaggacac gcaggagaaa gaaggtattc 180 
aggccaaata cccaagccta ggacaaaagc 240 
tccagaaagg gcaaaagtac tttgactcng 300 
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gagactacaa 
acaagaacct 
ctcgctcgtc 
canatctgag 
tgcttttgca 
cctgttggtg 
tntcttncc 

<210> 17 

<211> 697 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 33, 48, 50, 55, 59, 60, 76, 77, 78, 90, 113, 118, 130, 135, 
141, 143, 150, 156, 166, 167, 170, 172, 180, 181, 190, 192, 
194, 199, 201, 209, 212, 224, 225, 226, 230, 233, 234, 236, 
242, 244, 251, 253, 256, 268, 297, 305, 308, 311, 314 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 315, 317, 322, 324, 327, 333, 337, 343, 362, 364, 367, 368, 
373, 384, 388, 394, 406, 411, 413, 423, 429, 438, 449, 450, 
473, 476, 479, 489, 491, 494, 499, 505, 507, 508, 522, 523, 
527, 530, 533, 535, 538, 539, 545, 548, 550, 552, 555 
<223> n = A,T,C or G 

<221> miscjfeature 

<222> 562, 563, 566, 568, 572, 577, 578, 580, 581, 591, 594, 622, 
628, 632, 638, 642, 644, 653, 658, 662, 663, 665, 669, 675, 
680, 686, 689 
<223> n = A, T, C or G 



ca tggccaaa gccaacatga agaataagca gctgccaagt gcangaccag 360 
ggtgactggt gatcacatcc ccaccccaca ggatctgccc agagaaagtc 420 
accagcaagc ttgcgggtgg ccaagttgaa tgatgctgcc ggggctctgc 480 
acgcttccct ccctgcccca cccgggtcct gtgctggctc ctgcccttcc 540 
gccangggtc aggaagtggc ncnggtngtg gctggaaagc aaaacccttt 600 
tcccacccat ggagcccctg gggcgagccc angaacttga ncctttttgt 660 

669 



<400> 17 

gcaagatatg gacaactaag 
gacgcgctga ggagannnac 
gcctgcccan gggancccca 
ncctggctcn cncngcccng 
cncnccctcc ncnacnacct 
ccacnacncc ntcnncncga 
cncnacnncg cgntcccccg 
agncacgcnc tccgcccnct 
ccccgctcnc nccnctgcnc 
ccccngcngn angcngtgcg 
cgcccgctgg gggctcccgc 
cnncnctcnc gctcngcgcn 



tgagaaggta atnctctact 
gctggcccan ctgccggcca 
ncnctcggan cccatntcac 
nccagctcnc gnccccctcc 
cctacccncg gctccctccc 
ancnccnctc gcnctcngcc 
cgcncgcngc ctcnccccct 
gacgccccnn cccgccgcgc 
gccgncnngg cgccccgccc 
cnncangncc gngccgnncn 
cncgcggntc antccccncc 
cgcccnccnc ccccccc 



gctctagntn ctccnggcnn 60 
cacacgggga tcntggtnat 120 
acccgnnccn tncgcccacn 180 
gccnnnctcn ttnncntctc 240 
cagccccccc ccgcaancct 300 
ccngccccct gccccccgcc 360 
cccacnacag ncncacccgc 420 
tcaccttcat ggnccnacng 480 
cnnccgngtn ccncncgnng 540 
ncaccctccg nccnccgccc 600 
cntncgccca ctntccgntc 660 

697 



<210> 18 

<211> 670 " 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 234, 292, 329, 437, 458, 478, 487, 524, 542, 549, 550, 557, 
576, 597, 603, 604, 646, 665 
<223> n = A, T, C or G 
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<400> 18 
ctcgtgtgaa 
ctgacctcca 
gggacggctg 
cggcgccgtg 
catcttcttc 
cttcaggatc 
aatctcctcc 
tctcgaccaa 
, gaacgantgt 
gncctcacnn 
acnnaaaagg 
tttanccacc 



gggtgcagta 
gtgccgccgg 
cccgccgggc 
gcctacggtg 
aatcggatcg 
cttggttcca 
ctacaggctc 
tgctcangaa 
tgccgtccat 
ctgatcnccc 
gccaaggact 



cctaagccgg 
cctcaagatc 
cccggggcat 
tgcgcgaatc 
gtggagtgca 
gtaccccanc 
caaagaccta 
cttcctaaca 
tgtcacgaag 
agcggggcca 
tcccctcatc 



agcggggtag 
agacatggcc 
gggcacggcc 
tgtgttcacc 
caggacacta 
atctatgaca 
cagatggtga 
tgttccancg 
tgctcaagaa 
agttanccct 
ctggataatg 



<210> 19 

<211> 606 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 506 

<223> n = A, T, C or G 



aggcgggccg 
cagaacttga 
ctgaagctgt 
gtggaaggcg 
tcctgggccg 
ttcgggccag 
atatctccct 
cctaagggct 
tttnggtggc 
ggttgatccc 
tggccntcac 



gcaccccctt 60 
acgacttggc 120 
tgctgggggc 180 
ggcncagagc 240 
anggccttca 300 
acctcgaaaa 360 
gcgagtgttg 420 
ggactacnaa 480 
caagttcaat 54 0 
cgggganctg 600 
aaagctcaac 660 
670 



<400> 19 

actagtgcca 

tggcctcagt 

tgtcgccttg 

ccaggctgtg 

tcacatgcgt 

tactatgtgt 

gggcactagc 

gagctgctgg 

cagccaaaag 

tcttctgtct 

gagacc 



acctcagctc 
tgtccttggt 
gctcaactgt 
ccctggaaag 
cctacctgtg 
ctgtccactg 
ctgactttta 
tttagccttg 
ctgaatggaa 
gttttgtttt 



ccaggccagt 
tattgatggg 
ggttgatttg 
tactacagcc 
aaactctggg 
acgactgtca 
aggcagtgtg 
cacctgggga 
aagttnagaa 
tcaattgaaa 



tctctgaatg 
ggacaaattg 
tctgtgcccg 
atcctccaac 
aagcaggaag 
aggcctcatt 
tctttctgag 
aaggatgtat 
cattcctagg 
agttattaaa 



tcgaggagtt 
gggatggcca 
gaaagtttgg 
agaagtacgg 
gcccaagacc 
tgcagaggcc 
cactgtagac 
ttatttgtat 
tggccttatt 
taacagattt 



ccaggatctc 60 
gagccccgag 120 
catcattcgt 180 
actgctcccc 240 
tggtgctgga 300 
accggagcta 360 
caagcccttg 420 
tttcatatat 480 
ctaataagtt 540 
agaatctagt 600 
606 



<210> 20 

<211> 449 

<212> DNA 

<213> Homo sapiens 



<400> 20 

actagtaaac 

cagcgccaga 

ccaccacagc 

tgccagaaca 

cttcaagaat 

tgaagtcaca 

atttctttag 

aaaacaaaat 



aacagcagca 
gccgaggaga 
cgcctgccag 
tcaaggagtt 
acaacaacta 
ccagggcaac 
tgtcattgcc 
cttgactgct 



gaaacatcag 
acccccgctc 
gatggactcg 
cactgcccaa 
agaaaaggaa 
tcttggaaga 
gattttggct 
tgctcaaaa 



tatcagcagc 
cctgaggagg 
ctgctcattg 
aacttaggca 
gtttccagaa 
aatatatttg 
ataacagtgt 



gtcgccagca 
acctgtccaa 
caggccagat 
agctcttcat 
aagaagttaa 
catattgaaa 
ctttctagcc 



ggagaatatg 60 
actcttcaaa 120 
aaacacttac 180 
ggcccaggct 240 
catgaactct 300 
agcacagagg 360 
ataataaaat 420 
449 



<210> 21 

<211> 409 

<212> DNA 

<213> Homo sapiens 
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<400> 21 

tatcaatcaa ctggtgaata attaaacaat gtgtggtgtg atcatacaaa gggtaccact 60 

caatgataaa aggaacaagc tgcctatatg tggaacaaca tggatgcatt tcagaaactt 120 

tatgttgagt gaaagaacaa acacggagaa catactatgt ggttctcttt atgtaacatt 180 

acagaaataa aaacagaggc aaccaccttt gaggcagtat ggagtgagat agactggaaa 240 

aaggaaggaa ggaaactcta cgctgatgga aatgtctgtg tcttcattgg gtggtagtta 300 

tgtggggata tacatttgtc aaaatttatt gaactatata ctaaagaact ctgcatttta 360 

ttgggatgta aataatacct caattaaaaa gacaaaaaaa aaaaaaaaa 409 

<210> 22 

<211> 649 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 263, 353, 610, 635, 646 

<223> n - A,T/C or G 

<400> 22 

acaattttca ttatcttaag cacattgtac atttctacag aacctgtgat tattctcgca 60 
tgataaggat ggtacttgca tatggtgaat tactactgtt gacagtttcc gcagaaatcc 120 
tatttcagtg gaccaacatt gtggcatggc agcaaatgcc aacattttgt ggaatagcag 180 
caaatctaca agagaccctg gttggttttt cgttttgttt tctttgtttt ttcccccttc 240 
tcctgaatca gcagggatgg aangagggta gggaagttat gaattactcc ttccagtagt 300 
agctctgaag tgtcacattt aatatcagtt ttttttaaac atgattctag ttnaatgtag 360 
aagagagaag aaagaggaag tgttcacttt tttaatacac tgatttagaa atttgatgtc 420 
ttatatcagt agttctgagg tattgatagc ttgctttatt tctgccttta cgttgacagt 480 
gttgaagcag ggtgaataac taggggcata tatatttttt ttttttgtaa gctgtttcat 54 0 
gatgttttct ttggaatttc cggataagtt caggaaaaca tctgcatgtt gttatctagt 600 
ctgaagttcn tatccatctc attacaacaa aaacncccag aacggnttg 64 9 

<210> 23 
<211> 669 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 642, 661 
<223> n = A,T,C or G 

<400> 23 

actagtgccg tactggctga aatccctgca 
tactctcagt caccagctct ggaattagat 
tatcctctga cagcctttgg gctgcctcgg 
tcacctgtcg tgcccccctc tgtcaagact 
cgcaaggtgg tgctgatgca gtgcaacatt 
ctgacacttc tgctgaagtt ggaggacaaa 
ccaaatgaga atatccccga gttggcggct 
gctgaccaga gccggttgac ttctctgcta 
ggaacagtac cctcaactca gccgctgtca 
gccctgatct gcgctgtggc tgtcctggac 
agtattacct gtgaagccct tccctccttt 
nttctaacc 



ggaccaggaa gagaaccagt tcagactttg 60 
aaattccttg aagatgtcag gaatgggatc 120 
ccccagcagc cacagcagga ggaggtgaca 180 
ccgacacctg aaccagctga ggtggagact 240 
gagtcggtgg aggagggagt caaacaccac 300 
ctgaaccggc acctgagctg tgacctgatg 360 
gagctggtgc agctgggctt cattagtgag 420 
gaagagactt gaacaagttc aattttgcca 480 
ccgtctcctc ttagagctca ctcgggccag 540 
gtgctgcacc ctctgtcctt ccccccagtc 600 
attattcagg anggctgggg gggctccttg 660 

669 



<210> 24 
<211> 442 
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<212> DNA 

<213> Homo sapiens 



<400> 24 

actagtacca 

tcactgccat 

gatgactatc 

cttacgatgc 

ggtaatgcac 

cggaaagaga 

gatgatctct 

gacctaaaaa 



tcttgacaga 
cattaagcat 
attattctag 
acttttctcc 
ttgctagtac 
aaagccttcc 
gacgatacct 
aaaaaaaaga 



ggatacatgc 
cagtttcaaa 
tcctttgaat 
agcacatcag 
tacacacttt 
tttgttggcc 
gtatgttctt 



tcccaaaacg 
attatagcca 
ttgtaagggg 
atttcaaatt 
ggtacaacaa 
cttaaactga 
attgtgtaaa 



tttgttacca 
ttcatgattt 
aaaaaaaaca 
gaaaattaaa 
aaaacagagg 
gtcaagatct 
taaaattgct 



cacttaaaaa 60 
actttttcca 120 
aaaacaaaaa 180 
gacatgctat 240 
caagaaacaa 300 
gaaatgtaga 360 
ggtatgaaat 420 
442 



<210> 25 

<211> 656 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 330, 342, 418, 548, 579, 608 
<223> n = A,T,C or G 



<400> 25 

tgcaagtacc 

ccccggaatg 

accctaatgg 

aggcctgagg 

gacaggatgt 

ctcctagcag 

gggctgatct 

atgggacagt 

attaaaaatc 

tgacatantt 

ctcctganac 



acacactgtt 
tacagtgtct 
ggcagagagt 
tagaggggag 
tagataaagg 
ctggtaaagg 
gattacttcc 
tttccatatc 
actgccctaa 
cttggcatgg 
tcatctacat 



tgaattttgc 
tggtgcacca 
atagccctag 
tggtatgtgt 
ctctagttag 
ggtgctggan 
tggcatcccg 
cttgctgtgg 
ctacacttcc 
ggagccagcc 
agaattggtt 



acaaaaagtg 
agatgccttc 
cccagtggtg 
tttctcagtg 
ggtgtcattg 
gccatggagg 
ctcactttta 
agctctggaa 
tccttgaagg 
acaaatgana 
aaaccctccc 



actgtaggat 
taaaggctga 
acatgaccac 
gaagcagcac 
tcatttgaga 
anctctagaa 
tgggaagtct 
cactctctaa 
aatagaaatg 
atctgaacgt 
ttggaataag 



caggtgatag 60 
cataccttgg 120 
tccctttggg 180 
atgagtgggt 240 
gactgacaca 300 
acattagcat 360 
tattagangg 420 
atttccctct 480 
gaactttctc 540 
gtccaggttt 600 
gaaaaa 656 



<210> 26 

<211> 434 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 395 

<223> n = A,T,C or G 



<400> 2 6 

actagttcag 

ctaggtgttt 

acaaaaaaac 

caccagggtt 

aataactgaa 

gaataagtta 

gtcatttgta 

aaaaaaaaaa 



actgccacgc 
ccatctatgt 
gctgccaggt 
cttttgaaat 
ttgtcaggct 
taatcagtat 
ctgtttgaaa 



caaccccaga 
ttcaatctgt 
tttagaagca 
agtaccacat 
ttgattgata 
tcatctcttt 
aatatttctt 



aaatacccca 
ccatctacca 
gttctggtct 
gtaaaaggga 
attgtagaaa 
gttttttgtc 
ctatnaaatt 



catgccagaa 
ggcctcgcga 
caaaaccatc 
atttggcttt 
taagtagcct 
actcttttct 
aaactaacct 



aagtgaagtc 60 
taaaaacaaa 120 
aggatcctgc 180 
cacttcatct 240 
tctgttgtgg 300 
ctctaattgt 360 
gccttaaaaa 420 
434 



<210> 27 
<211> 654 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 505, 533, 563, 592, 613, 635, 638 
<223> n - A,T,C or G 

<400> 27 

actagtccaa cacagtcaga aacattgttt tgaatcctct gtaaaccaag gcattaatct 60 

taataaacca ggatccattt aggtaccact tgatataaaa aggatatcca taatgaatat 120 

tttatactgc atcctttaca ttagccacta aatacgttat tgcttgatga agacctttca 180 

cagaatccta tggattgcag catttcactt ggctacttca tacccatgcc ttaaagaggg 24 0 

gcagtttctc aaaagcagaa acatgccgcc agttctcaag ttttcctcct aactccattt 300 

gaatgtaagg gcagctggcc cccaatgtgg ggaggtccga acattttctg aattcccatt 360 

ttcttgttcg cggctaaatg acagtttctg tcattactta gattccgatc tttcccaaag 420 

gtgttgattt acaaagaggc cagctaatag cagaaatcat gaccctgaaa gagagatgaa 480 

attcaagctg tgagccaggc agganctcag tatggcaaag gtcttgagaa tcngccattt 54 0 

ggtacaaaaa aaattttaaa gcntttatgt tataccatgg aaccatagaa anggcaaggg 600 

aattgttaag aanaatttta agtgtccaga cccanaanga aaaaaaaaaa aaaa 654 

<210> 28 
<211> 670 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<222> 101, 226, 274, 330, 385, 392, 397, 402, 452, 473, 476, 532, 
534, 538, 550, 583, 595, 604, 613, 622, 643, 669 
<223> n = A,T,C or G 

<400> 28 

cgtgtgcaca tactgggagg atttccacag ctgcacggtc acagccctta cggattgcca 60 

ggaaggggcg aaagatatgt gggataaact gagaaaagaa nccaaaaacc tcaacatcca 120 

aggcagctta ttcgaactct gcggcagcgg caacggggcg gcggggtccc tgctcccggc 180 

gttcccggtg ctcctggtgt ctctctcggc agctttagcg acctgncttt ccttctgagc 240 

gtggggccag ctccccccgc ggcgcccacc cacnctcact ccatgctccc ggaaatcgag 300 

aggaagatca ttagttcttt ggggacgttn gtgattctct gtgatgctga aaaacactca 360 

tatagggaat gtgggaaatc ctganctctt tnttatntcg tntgatttct tgtgttttat 420 

ttgccaaaat gttaccaatc agtgaccaac cnagcacagc caaaaatcgg acntcngctt 480 

tagtccgtct tcacacacag aataagaaaa cggcaaaccc accccacttt tnantttnat 540 

tattactaan ttttttctgt tgggcaaaag aatctcagga acngccctgg ggccnccgta 600 

ctanagttaa ccnagctagt tncatgaaaa atgatgggct ccncctcaat gggaaagcca 660 

agaaaaagnc 670 

<210> 29 
<211> 551 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<222> 336, 474, 504, 511, 522, 523, 524, 540, 547 
<223> n ~ A, T, C or G 

<400> 29 

actagtcctc cacagcctgt gaatccccct agacctttca agcatagtga gcggagaaga 60 
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agatctcagc gtttagccac cttacccatg 
ccctctccag ccactgatgg gaaagtattc 
tcagtaccag aggtgcctga tgttgcacat 
cctcttgact taagtcgtgg ttcagaagtt 
cgtaatgaat gtcccagggc agaaaaagag 
aaagcaatag ctgatgggaa gaggagctcc 
aaaagtgaaa ttgggaagac aaaagctcaa 
aggaaggaag agagaagaga gacnaagatc 
aaaaaanaaa a 

<210> 30 
<211> 684 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<222> 545, 570, 606, 657, 684 

<223> n » A,T,C or G 



cctgatgatt ctgtagaaaa ggtttcttct 120 
tccatcagtt ctcaaaatca gcaagaatct 180 
ttgccacttg agaagctggg accctgtctc 240 
acagcaccgg tagcctcaga ttcctcttac 300 
gatacncaga tgcttccaaa tccttcttcc 360 
agcagcagca ggaatatcga aaacagaaaa 420 
cagcatttgg taaggagaaa aganaagatg 480 
nctacggacc gnnncggaag aagaagaagn 540 

551 



<400> 30 

actagttcta tctggaaaaa gcccgggttg gaagaagctg tggagagtgc gtgtgcaatg 60 
cgagactcat ttcttggaag catccctggc aaaaatgcag ctgagtacaa ggttatcact 120 
gtgatagaac ctggactgct ttttgagata atagagatgc tgcagtctga agagacttcc 180 
agcacctctc agttgaatga attaatgatg gcttctgagt caactttact ggctcaggaa 240 
ccacgagaga tgactgcaga tgtaatcgag cttaaaggga aattcctcat caacttagaa 300 
ggtggtgata ttcgtgaaga gtcttcctat aaagtaattg tcatgccgac tacgaaagaa 360 
aaatgccccc gttgttggaa gtatacagcg ggagtcttca gatacactgt gtcctcgatg 420 
tgcagaagtt gtcagtggga aaatagtatt aacagctcac tcgagcaaga accctcctga 4 80 
cagtactggg ctagaagttt ggatggatta tttacaatat aggaaagaaa gccaagaatt 540 
aggtnatgag tggatgagta aatggtggan gatggggaat tcaaatcaga attatggaag 600 
aagttnttcc tgttactata gaaaggaatt atgtttattt acatgcagaa aatatanatg 660 
tgtggtgtgt accgtggatg gaan 684 

<210> 31 
<211> 654 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 326, 582, 651 
<223> n = A,T,C or G 



<400> 31 

gcgcagaaaa ggaaccaata tttcagaaac 
aacatcttct cagaatgacc cagaagttat 
tttggcagct gtgctttcca gagatggaag 
agagcctgac agaatagttg gagaattcct 
ccttggtctt ggagatacag tggaaggtct 
tcatgatcag ggaaagcaaa tcagangttc 
aagtgcagag tggaagagct ttccatcacg 
ctatggcaga gcccaatgca aagtttattg 
atgatgttgt gatgggagtt cagtacaagg 
catgctccac tgactgttgt tgcagatggg 
tcaataaagt ttctgtatca ctcatttggt 

<210> 32 



aagcttaata ggaacagctg cctgtacatc 60 
catcgtggga gctggcgtgc ttggctctgc 120 
aaaggtgaca gtcattgaga gagacttaaa 180 
gcagccgggt ggttatcatg ttctcaaaga 240 
tgatgcccag gttgtaaatg gttacatgat 300 
agattcctta ccctctgtca gaaaacaatc 360 
gaagattcat catgagtctc cggaaagcag 420 
aaggtgttgt gttacagtta ttagaggaag 480 
ataaagagac tgggagatat caaggaactc 540 
cttttctcca anttcaggaa aagcctggtc 600 
tggcttctta tgaagaatgc nccc 654 
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<211> 673 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 3*76, 545, 627 
<223> n = A,T,C or G 

<400> 32 

actagtgaag aaaaagaaat 
tatcacctga caccaggagt 
ttaaagacca cacaaggaag 
aatgaattga aatcaaaaga 
gataaactcc tctatccagc 
aataaattaa tcaaatacat 
cccgtgactg tctatnagcc 
tgtgggaaat aactgaaaaa 
atacctagga tttctactgg 
aagangtccc aaggtcacca 
gaaattaaaa gacgcttcag 
cagggattag aaa 

<210> 33 

<211> 673 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 325, 419, 452, 532, 538, 542, 571, 600, 616, 651, 653, 672 
<223> n « A,T,C or G 

<400> 33 

actagttatt tactttcctc cgcttcagaa ggtttttcag actgagagcc taagcatact 60 
ggatctgttg tttcttttgg gtctcacctc atcagtgtgc atagtggcag aaattataaa 120 
gaaggttgaa aggagcaggg aaaagatcca gaagcatgtt agttcgacat catcatcttt 180 
tcttgaagta tgatgcatat tgcattattt tatttgcaaa ctaggaattg cagtctgagg 240 
atcatttaga agggcaagtt caagaggata tgaagatttg agaacttttt aactattcat 300 
tgactaaaaa tgaacattaa tgttnaagac ttaagacttt aacctgctgg cagtcccaaa 360 
tgaaattatg caactttgat atcatattcc ttgatttaaa ttgggctttt gtgattgant 420 
gaaactttat aaagcatatg gtcagttatt tnattaaaaa ggcaaaacct gaaccacctt 480 
ctgcacttaa agaagtctaa cagtacaaat acctatctat cttagatgga tntatttntt 540 
tntattttta aatattgtac tatttatggt nggtggggct ttcttactaa tacacaaatn 600 
aatttatcat ttcaanggca ttctatttgg gtttagaagt tgattccaag nantgcatat 660 
ttcgctactg tnt 673 

<210> 34 

<211> 684 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 414, 472, 480, 490, 503, 507, 508, 513, 523, 574, 575, 598, 

659, 662, 675 

<223> n - A,T,C or G 



tctgatacgg gacaaaaatg ctcttcaaaa catcattctt 60 
tttcattgga aaaggatttg aacctggtgt tactaacatt 120 
caaaatcttt ctgaaagaag taaatgatac acttctggtg 180 
atctgacatc atgacaacaa atggtgtaat tcatgttgta 240 
agacacacct gttggaaatg atcaactgct ggaaatactt 300 
ccaaattaag tttgttcgtg gtagcacctt caaagaaatc 360 
aattattaaa aaatacacca aaatcattga tgggagtgcc 420 
gagaccgaga agaacgaatc attacaggtc ctgaaataaa 480 
aggtggagaa acagaagaac tctgaagaaa ttgttacaag 540 
aattcattga aggtggtgat ggtctttatt tgaagatgaa 600 
ggagacnccc catgaaggaa ttgccagcca caaaaaaatt 660 

673 
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<400> 34 

actagtttat tcaagaaaag aacttactga ttcctctgtt cctaaagcaa gagtggcagg 60 

tgatcagggc tggtgtagca tccggttcct ttagtgcagc taactgcatt tgtcactgat 120 

gaccaaggag gaaatcacta agacatttga gaagcagtgg tatgaacgtt cttggacaag 180 

ccacagttct gagccttaac cctgtagttt gcacacaaga acgagctcca cctccccttc 240 

ttcaggagga atctgtgcgg atagattggc tggacttttc aatggttctg ggttgcaagt 300 

gggcactgtt atggctgggt atggagcgga cagccccagg aatcagagcc tcagcccggc 360 

tgcctggttg gaaggtacag gtgttcagca ccttcggaaa aagggcataa agtngtgggg 420 

gacaattctc agtccaagaa gaatgcattg accattgctg gctatttgct tncctagtan 4 80 

gaattggatn catttttgac cangatnntt ctnctatgct ttnttgcaat gaaatcaaat 540 

cccgcattat ctacaagtgg tatgaagtcc tgcnnccccc agagaggctg ttcaggcnat 600 

gtcttccaag ggcagggtgg gttacaccat tttacctccc ctctcccccc agattatgna 660 

cncagaagga atttntttcc tccc 684 

<210> 35 

<211> 614 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 17, 20, 152, 223, 267, 287, 304, 306, 316, 319, 321, 355, 

365, 382, 391, 407, 419, 428, 434, 464, 467, 477, 480, 495, 

499 f 505, 515, 516, 522, 524, 527, 542, 547, 549, 567, 572, 
576, 578 

<223> n = A,T,C or G 
<400> 35 

actagtccaa cgcgttngcn aatattcccc tggtagccta cttccttacc cccgaatatt 60 

ggtaagatcg agcaatggct tcaggacatg ggttctcttc tcctgtgatc attcaagtgc 120 

tcactgcatg aagactggct tgtctcagtg tntcaacctc accagggctg tctcttggtc 180 

cacacctcgc tccctgttag tgccgtatga cagcccccat canatgacct tggccaagtc 24 0 

acggtttctc tgtggtcaat gttggtnggc tgattggtgg aaagtanggt ggaccaaagg 300 

aagncncgtg agcagncanc nccagttctg caccagcagc gcctccgtcc tactngggtg 360 

ttccngtttc tcctggccct gngtgggcta nggcctgatt cgggaanatg cctttgcang 420 

gaaggganga taantgggat ctaccaattg attctggcaa aacnatntct aagattnttn 480 

tgctttatgt ggganacana tctanctctc atttnntgct gnanatnaca ccctactcgt 540 

gntcgancnc gtcttcgatt ttcgganaca cnccantnaa tactggcgtt ctgttgttaa 600 
aaaaaaaaaa aaaa 614 

<210> 36 

<211> 686 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 222, 224, 237, 264, 285, 548, 551, 628, 643, 645, 665, 674 
<223> n - A,T,C or G 



<400> 36 

gtggctggcc cggttctccg cttctcccca 
ctccctcgtc gactgttgct tgctggtcgc 
taacctcggt gccaccggat tgcccttctt 
gggcgggggc ctggagcagc ccgaggcact 
ctcagctcgc cagtccggtc gctngcttcc 
acctgctctg ggcacacgcg acccgtggtt 
ggtatttctt aatcagcgct tgcaaagatg 



tcccctactt tcctccctcc ctccctttcc 60 
agactccctg acccctccct cacccctccc 120 
ttcctgttgc ccagcccagc cctagtgtca 180 
gcagcagaag ananaaaaga cacgacnaac 240 
cgccgcatgg caatnagaca gacgccgctc 300 
gatttggcct tcagtggcat cacccttatg 360 
gttaacctat gctacgccag ggagatacag 420 
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gagactggat tggaacattt ttggggtcta aaggtctgtt tggggtgcaa cactgaataa 4 80 

ggatgccacc aaagcagcta cagcagctgc agatttcaca gcccaagtgt gggatgctgt 54 0 

ctcagganat naattgataa cctggctcat aacacattgt caagaatgtg gatttcccca 600 

ggatattatt atttgtttac cggggganag gataactgtt tcncntattt taattgaaca 660 

aactnaaaca aaanctaagg aaatcc 686 

<210> 37 

<211> 681 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 7, 10, 11, 19, 25, 32, 46, 53, 77, 93, 101, 103, 109, 115, 
123, 128, 139, 157, 175, 180, 192, 193, 194, 212, 218, 226, 
227, 233, 240, 241, 259, 260, 267, 289, 296, 297, 298, 312, 
313, 314, 320, 325, 330, 337, 345, 346, 352, 353, 356 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 382, 385, 400, 427, 481, 484, 485, 491, 505, 515, 533, 542, 
544, 554, 557, 560, 561, 564, 575, 583, 589, 595, 607, 619, 
628, 634, 641, 645, 658, 670 
<223> n = A,T,C or G 



<400> 37 

gagacanacn naacgtcang agaanaaaag angcatggaa cacaanccag gcncgatggc 60 
caccttccca ccagcancca gcgcccccca gcngccccca ngnccggang accangactc 120 
cancctgnat caatctganc tctattcctg gcccatncct acctcggagg tggangccgn 180 
aaaggtcgca cnnncagaga agctgctgcc ancaccancc gccccnnccc tgncgggctn 240 
nataggaaac tggtgaccnn gctgcanaat tcatacagga gcacgcgang ggcacnnnct 300 
cacactgagt tnnngatgan gcctnaccan ggacctnccc cagcnnattg annacnggac 360 
tgcggaggaa ggaagacccc gnacnggatc ctggccggcn tgccaccccc ccacccctag 420 
gattatnccc cttgactgag tctctgaggg gctacccgaa cccgcctcca ttccctacca 480 
natnntgctc natcgggact gacangctgg ggatnggagg ggctatcocc cancatcccc 54 0 
tnanaccaac agcnacngan natnggggct ccccngggtc ggngcaacnc tcctncaccc 600 
cggcgcnggc cttcggtgnt gtcctccntc aacnaattcc naaanggcgg gccccccngt 660 
ggactcctcn ttgttccctc c 681 

<210> 38 

<211> 687 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature ■ 

<222> 3, 30, 132, 151, 203, 226, 228, 233, 252, 264, 279, 306, 
308, 320, 340, 347, 380, 407, 429, 437, 440, 445, 448, 491, 
559, 567, 586, 589, 593, 596, 603, 605, 606, 609, 626, 639, 
655, 674, 682 
<223> n = A, T,C or G 



<400> 38 

canaaaaaaa aaaacatggc cgaaaccagn 
ctcccggcct gtgtccggaa ggtttccctc 
gagggcggga cntgccgggg ccggagctca 
atcgcaaggg cggcgctaac ctnaggcctc 
gggTOCtgtg anaaccgcaa aaanaacgct 



aagctgcgcg atggcgccac ggcccctctt 60 
cgaggcgccc cggctcccgc aagcggagga 120 
naggccctgg ggccgctctg ctctcccgcc 180 
cccgcaaagg tccccnangc ggnggcggcg 240 
gggcgcgcng cgaacccgtc cacccccgcg 300 
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aaggananac ttccacagan gcagcgtttc cacagcccan agccacnttt ctagggtgat 360 

gcaccccagt aagttcctgn cggggaagct caccgctgtc aaaaaanctc ttcgctccac 420 

cggcgcacna aggggangan ggcangangc tgccgcccgc acaggtcatc tgatcacgtc 480 

gcccgcccta ntctgctttt gtgaatctcc actttgttca accccacccg ccgttctctc 540 

ctccttgcgc cttcctctna ccttaanaac cagcttcctc tacccnatng tanttnctct 600 

gcncnngtng aaattaattc ggtccnccgg aacctcttnc ctgtggcaac tgctnaaaga 660 

aactgctgtt ctgnttactg cngtccc 687 

<210> 39 

<211> 695 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 300, 401, 423, 429, 431, 437, 443, 448, 454, 466, 492, 515, 
523, 524, 536, 538, 541, 552, 561, 566, 581, 583, 619, 635, 
636, 641, 649, 661, 694 
<223> n * A,T,C or G 

<400> 39 

actagtctgg cctacaatag tgtgattcat gtaggacttc tttcatcaat tcaaaaccc'c 

tagaaaaacg tatacagatt atataagtag ggataagatt tctaacattt ctgggctctc 

tgacccctgc gctagactgt ggaaagggag tattattata gtatacaaca ctgctgttgc 

cttattagtt ataacatgat aggtgctgaa ttgtgattca caatttaaaa acactgtaat 

ccaaactttt ttttttaact gtagatcatg catgtgaatg ttaatgttaa tttgttcaan 

gttgttatgg gtagaaaaaa ccacatgcct taaaatttta aaaagcaggg cccaaactta 

ttagtttaaa attaggggta tgtttccagt ttgttattaa ntggttatag ctctgtttag 

aanaaatcna ngaacangat ttngaaantt aagntgacat tatttnccag tgacttgtta 

atttgaaatc anacacggca ccttccgttt tggtnctatt ggnntttgaa tccaancngg 

ntccaaatct tnttggaaac ngtccnttta acttttttac nanatcttat ttttttattt 

tggaatggcc ctatttaang ttaaaagggg ggggnnccac naccattcnt gaataaaact 

naatatatat ccttggtccc ccaaaattta aggng 

<210> 40 
<211> 674 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__ feature 

<222> 403, 428, 432, 507, 530, 543, 580, 583, 591, 604, 608, 621, 
624, 626, 639, 672 
<223> n - A,T,C or G 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

695 



<400> 40 

actagtagtc agttgggagt ggttgctata 
tattaaataa tagaaaagaa aatcccggtg 
ttacagaaaa tatagccatg attgaaatca 
tcttagctca tcttaaataa gtagtacact 
gttgtaacaa tagcacaaat cgaacttagg 
tgatcaattc tttaattttg ggaacctata 
ttaaatggat cactgatatt taagtcattc 
attagganaa antacctccc agcacagccc 
tggaatgagt ctcctttatt tccgaantgt 
tgnttgggtt gggtattaat ttgaactgtg 
aaantttncc ggttaatttg nctngncaaa 
atttgctatt cngg 



ccttgacttc atttatatga atttccactt 60 
cttgcagtag agttatagga cattctatgc 120 
aatagtaaag gctgttctgg ctttttatct 180 
tgggatgcag tgcgtctgaa gtgctaatca 24 0 
atgtgtttct tctcttctgt gtttcgattt 300 
atacagtttt cctattcttg gagataaaaa 360 
tgcttctcat ctnaatattc catattctgt 420 
cctctcaaac cccacccaaa accaagcatt 480 
ggatggtata acccatatcn ctccaatttc 54 0 
catgaaaagn ggnaatcttt nctttgggtc 600 
tccaatttnc tttaagggtg tctttataaa 660 

674 
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<210> 41 

<211> 657 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 243, 247, 251, 261, 267, 272, 298, 312, 315, 421, 432, 434, 
501, 524, 569, 594, 607, 650 
<223> n = A,T,C or G 

<400> 41 

gaaacatgca agtaccacac actgtttgaa ttttgcacaa aaagtgactg tagggatcag 60 
gtgatagccc cggaatgtac agtgtcttgg tgcaccaaga tgccttctaa aggctgacat 120 
accttgggac cctaatgggg cagagagtat agccctagcc cagtggtgac atgaccactc 180 
cctttgggag gctgaagtta aagggaatgg tatgtgtttt ctcatggaag cagcacatga 240 
atnggtnaca ngatgttaaa ntaaggntct antttgggtg tcttgtcatt tgaaaaantg 300 
acacactcct ancanctggt aaaggggtgc tggaagccat ggaagaactc taaaaacatt 360 
agcatgggct gatctgatta cttcctggca tcccgctcac ttttatggga agtcttatta 420 
naaggatggg ananttttcc atatccttgc tgttggaact ctggaacact ctctaaattt 4 80 
ccctctatta aaaatcactg nccttactac acttcctcct tganggaata gaaatggacc 540 
tttctctgac ttagttcttg gcatggganc cagcccaaat taaaatctga cttntccggt 600 
ttctccngaa ctcacctact tgaattggta aaacctcctt tggaattagn aaaaacc 657 

<210> 42 

<211> 389 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 179, 317, 320 
<223> n = A,T,C or G 

<400> 42 

actagtgctg aggaatgtaa acaagtttgc tgggccttgc gagacttcac caggttgttt 60 
cgatagctca cactcctgca ctgtgcctgt cacccaggaa tgtctttttt aattagaaga 120 
caggaagaaa acaaaaacca gactgtgtcc cacaatcaga aacctccgtt gtggcagang 180 
ggccttcacc gccaccaggg tgtcccgcca gacagggaga gactccagcc ttctgaggcc 240 
atcctgaaga attcctgttt gggggttgtg aaggaaaatc acccggattt aaaaagatgc 300 
tgttgcctgc ccgcgtngtn gggaagggac tggtttcctg gtgaatttct taaaagaaaa 360 
atattttaag ttaagaaaaa aaaaaaaaa 389 

<210> 43 

<211> 279 

<212> DNA 

<213> Homo sapiens 

<400> 43 

actagtgaca agctcctggt cttgagatgt cttctcgtta aggagatggg ccttttggag 60 
gtaaaggata aaatgaatga gttctgtcat gattcactat tctagaactt gcatgacctt 120 
tactgtgtta gctctttgaa tgttcttgaa attttagact ttctttgtaa acaaataata 180 
tgtccttatc attgtataaa agctgttatg tgcaacagtg tggagatcct tgtctgattt 240 
aataaaatac ttaaacactg aaaaaaaaaa aaaaaaaaa 279 



<210> 44 
<211> 449 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 245, 256, 264, 266, 273, 281, 323, 325, 337, 393 
<223> n = A,T,C or G 

<400> 44 

actagtagca tcttttctac aacgttaaaa ttgcagaagt agcttatcat taaaaaacaa 60 
caacaacaac aataacaata aatcctaagt gtaaatcagt tattctaccc cctaccaagg 120 
atatcagcct gttttttccc ttttttctcc tgggaataat tgtgggcttc ttcccaaatt 180 
tctacagcct ctttcctctt ctcatgcttg agcttccctg tttgcacgca tgcgttgtgc 240 
aagantgggc tgtttngctt ggantncggt ccnagtggaa ncatgctttc ccttgttact 300 
gttggaagaa actcaaacct tcnancccta ggtgttncca ttttgtcaag tcatcactgt 360 
atttttgtac tggcattaac aaaaaaagaa atnaaatatt gttccattaa actttaataa 420 
aactttaaaa gggaaaaaaa aaaaaaaaa 449 

<210> 45 

<211> 559 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 263 

<223> n = A, T,C or G 
<400> 45 

actagtgtgg gggaatcacg gacacttaaa gtcaatctgc gaaataattc ttttattaca 60 

cactcactga agtttttgag tcccagagag ccattctatg tcaaacattc caagtactct 120 

ttgagagccc agcattacat caacatgccc gtgcagttca aaccgaagtc cgcaggcaaa 180 

tttgaagctt tgcttgtcat tcaaacagat gaaggcaaga gtattgctat tcgactaatt 240 

ggtgaagctc ttggaaaaaa ttnactagaa tactttttgt gttaagttaa ttacataagt 300 

tgtattttgt taactttatc tttctacact acaattatgc ttttgtatat atattttgta 360 

tgatggatat ctataattgt agattttgtt tttacaagct aatactgaag actcgactga 420 

aatattatgt atctagccca tagtattgta cttaactttt acagggtgaa aaaaaaattc 480 

tgtgtttgca ttgattatga tattctgaat aaatatggga atatatttta atgtgggtaa 540 
aaaaaaaaaa aaaaaggaa 559 

<210> 46 

<211> 731 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 270, 467, 477, 502, 635, 660, 671, 688, 695, 697, 725 
<223> n = A,T,C or G 

<400> 46 

actagttcta gtaccatggc tgtcatagat gcaaccatta tattccattt agtttcttcc 60 
tcaggttccc taacaattgt ttgaaactga atatatatgt ttatgtatgt gtgtgtgttc 120 
actgtcatgt atatggtgta tatgggatgt gtgcagtttt cagttatata tatattcata 180 
tatacatatg catatatatg tataatatac atatatacat gcatacactt gtataatata 240 
catatatata cacatatatg cacacatatn atcactgagt tccaaagtga gtctttattt 300 
ggggcaattg tattctctcc ctctgtctgc tcactgggcc tttgcaagac atagcaattg 360 
cttgatttcc tttggataag agtcttatct tcggcactct tgactctagc cttaacttta 420 
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gatttctatt ccagaatacc tctcatatct atcttaaaac ctaaganggg taaagangtc 480 
ataagattgt agtatgaaag antttgctta gttaaattat atctcaggaa actcattcat 540 
ctacaaatta aattgtaaaa tgatggtttg ttgtatctga aaaaatgttt agaacaagaa 600 
atgtaactgg gtacctgtta tatcaaagaa cctcnattta ttaagtctcc tcatagccan 660 
atccttatat ngccctctct gacctgantt aatananact tgaataatga atagttaatt 720 
taggnttggg c 731 

<210> 47' 

<211> 640 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 5, 28, 106, 153, 158, 173, 176, 182, 189, 205, 210, 214, 
225, 226, 229, 237, 260, 263, 269, 277, 281, 282, 322, 337, 
338, 354, 365, 428, 441, 443, 456, 467, 476, 484, 503, 508, 
554, 567, 575, 579, 588, 601, 606, 609, 611, 621, 636 
<223> n — A,T,C or G 



<400> 47 

tgcgngccgg tttggccctt ctttgtanga 
cgttaataac tcctcaggtc cctgcctgca 
gtacaccaaa tgtgacatcc tttcaccaat 
anacgactnc aacaattttt tgatnacccn 
ggagcagcat ggacctgtcn gcnactaang 
ttggtatgtc ttactgaaag anagaaacat 
caganattgc caatgccaag tccgagcggt 
tacatacntt gtccccgaaa nanaagatgc 
acanctacac ctggtgcttg ganaacanac 
cccagtgggt tttnccttgg cacctanctt 
ntggcnttnt nttgggacca ntcttctcac 

<210> 48 

<211> 257 

<212> DNA 

<213> Homo sapiens 



cactttcatc cgccctgaaa tcttcccgat 60 
cagggttttt tcttantttg ttgcctaaca 120 
atngattnct tcataccaca tcntcnatgg 180 
aaanactggg ggctnnaana agtacantct 240 
gaacaanagt nntgaacatt tacacaacct 300 
gcttctnncc ctagaccacg aggncaaccg 360 
tagatcaggt aatacattcc atggatgcat 420 
cctaanggct tcttcanact ggtccngaaa 480 
tctttggaag atcatctggc acaagttccc 540 
accanatcna ttcggaancc attctttgcc 600 
aactgnaccc 64 0 



<400> 48 

actagtatat gaaaatgtaa 
ccaccttgag cagccttgga 
tgattttctt tgttcctgaa 
ttatatttgt atatgtatca 
aaaaaaaaaa aaaaaaa 



atatcacttg tgtactcaaa 
aacctaacct gcctctttta 
aaagtgattt gtattagttt 
tcataaaata tttaaataaa 



caaaagttgg tcttaagctt 60 
gcataatcac attttctaaa 120 
tacatttgtt ttttggaaga 180 
aagtatcttt agagtgaaaa 240 

257 



<210> 49 

<211> 652 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<222> 410, 428, 496, 571, 647 

<223> n = A, T, C or G 

<400> 49 

actagttcag atgagtggct gctgaagggg 
tccacttatt tgaactctta agtcataaat 



cccccttgtc attttcatta taacccaatt 60 
gtataatgac ttatgaatta gcacagttaa 120 
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gttgacacta gaaactgccc atttctgtat tacactatca aataggaaac attggaaaga 180 

tggggaaaaa aatcttattt taaaatggct tagaaagttt tcagattact ttgaaaattc 240 

taaacttctt tctgtttcca aaacttgaaa atatgtagat ggactcatgc attaagactg 300 

ttttcaaagc tttcctcaca tttttaaagt gtgattttcc ttttaatata catatttatt 360 

ttctttaaag cagctatatc ccaacccatg actttggaga tatacctatn aaaccaatat 420 

aacagcangg ttattgaagc agctttctca aatgttgctt cagatgtgca agttgcaaat 4 80 

tttattgtat ttgtanaata caatttttgt tttaaactgt atttcaatct atttctccaa 54 0 

gatgcttttc atatagagtg aaatatccca ngataactgc ttctgtgtcg tcgcatttga 600 

cgcataactg cacaaatgaa cagtgtatac ctcttggttg tgcattnacc cc 652 

<210> 50 
<211> 650 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 237, 270, 311, 443, 454, 488, 520, 535, 539, 556, 567, 594, 
603, 634 

<223> n = A, T,C or G 
<400> 50 

ttgcgctttg atttttttag ggcttgtgcc ctgtttcact tatagggtct agaatgcttg 60 
tgttgagtaa aaaggagatg cccaatattc aaagctgcta aatgttctct ttgccataaa 120 
gactccgtgt aactgtgtga acacttggga tttttctcct ctgtcccgag gtcgtcgtct 180 
gctttctttt ttgggttctt tctagaagat tgagaaatgc atatgacagg ctgagancac 240 
ctccccaaac acacaagctc tcagccacan gcagcttctc cacagcccca gcttcgcaca 300 
ggctcctgga nggctgcctg ggggaggcag acatgggagt gccaaggtgg ccagatggtt 360 
ccaggactac aatgtcttta tttttaactg tttgccactg ctgccctcac ccctgcccgg 420 
ctctggagta ccgtctgccc canacaagtg ggantgaaat gggggtgggg gggaacactg 480 
attcccantt agggggtgcc taactgaaca gtagggatan aaggtgtgaa cctgngaant 54 0 
gcttttataa attatnttcc ttgttanatt tattttttaa tttaatctct gttnaactgc 600 
ccngggaaaa ggggaaaaaa aaaaaaaaat tctntttaaa cacatgaaca 650 

<210> 51 
<211> 545 
<212> DNA 
■ <213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 66, 159, 195, 205, 214, 243, 278, 298, 306, 337, 366, 375, 
382, 405, 446, 477, 492, 495, 503, 507, 508, 521, 537 
<223> n - A,T,C or G 

<400> 51 

tggcgtgcaa ccagggtagc tgaagtttgg gtctgggact ggagattggc cattaggcct 60 
cctganattc cagctccctt ccaccaagcc cagtcttgct acgtggcaca gggcaaacct 120 
gactcccttt gggcctcagt ttcccctccc cttcatgana tgaaaagaat actacttttt 180 
cttgttggtc taacnttgct ggacncaaag tgtngtcatt attgttgtat tgggtgatgt 240 
gtncaaaact gcagaagctc actgcctatg agaggaanta agagagatag tggatganag 300 
ggacanaagg agtcattatt tggtatagat ccacccntcc caacctttct ctcctca'gtc 360 
cctgcncctc atgtntctgg tntggtgagt cctttgtgcc accanccatc atgctttgca 420 
ttgctgccat cctgggaagg gggtgnatcg tctcacaact tgttgtcatc gtttganatg 4 80 
catgctttct tnatnaaaca aanaaannaa tgtttgacag ngfettaaaat aaaaaanaaa 54 0 
caaaa 545 



<210> 52 



WO 02/47534 



22 



PCT/US01/47576 



<211> 678 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_£eature 

<222> 98, 119, 121, 131, 136, 139, 140, 142, 143, 163, 168, 172, 
176, 184, 189, 190, 191, .200, 201, 205, 207, 221, 223, 229, 
230, 237, 240, 241, 255, 264, 266, 267, 276, 280, 288, 289, 
291, 297, 301, 306, 308, 314, 315, 326, 332, 335, 337 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 339, 341, 343, 344, 345, 347, 350, 355, 356, 358, 362, 363, 
372, 379, 395, 397, 398, 400, 403, 412, 414, 421, 423, 431, 
435, 438, 439, 450, 457, 463, 467, 471, 474, 480, 483, 484, 
487, 490, 491, 492, 493, 499, 500, 504, 508, 518, 536 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 538, 549, 551, 552, 554, 556, 557, 562, 563, 567, 571, 572, 
576, 579, 590, 592, 595, 598, 606, 609, 613, 620, 622, 624, 
626, 631, 634, 638, 641, 647, 654, 660, 661, 674 
<223> n = A,T,C or G * 



<400> 52 

actagtagaa gaactttgcc gcttttgtgc 
ggaggaagac gatttggggg gggagggggg 
ntatctccat ntccantgnn cnntgtcgcc 
tggnccccnn nccctctccn ncctncncct 
ncttccccat ctccntcccc cctnanngtc 
nctccncncc tccnnccgtt cttctnttct 
annctctccc cnctgcaanc gattctctcc 
ncncgctcct nttcntcnnc ccacctctcn 
cgnntcnttn nnntcctcnn accncccncc 
tctctcccnc nncncnncct cnncccntcc 
ccttcntcnc cantccatcn cntntnccat 
ntctctttca cacngtcc 

<210> 53 
<211> 502 
<212> DNA 

<213> Homo sapiens 



ctctcacagg cgcctaaagt cattgccatg 60 
gggggcangg tccgtggggc tttccctant 120 
tcttccctcg tcncattnga anttantccc 180 
cccccctccg ncncctccnn ctttttntan 240 
ccaacnccgn cagcaatnnc ncacttnctc 300 
cnacntntnc ncnnntnccn tgccnntnaa 360 
ctcchcnnan ctntccactc cntncttctc 420 
ccttcgnccc cantacnctc nccncccttn 480 
tcccttcncc cctcttctcc ccggtntntc 540 
nngcgnccnt ttccgccccn cnccnccntt 600 
nctncctncc nctcacnccc gctncccccn 660 

678 



<220> 

<221> misc_feature 

<222> 139, 146, 215, 217, 257, 263, 289, 386, 420, 452, 457, 461, 

466, 482, 486 

<223> n = A, T,C or G 

<400> 53 

tgaagatcct ggtgtcgcca tgggccgccg ccccgcccgt tgttaccggt attgtaagaa 60 

caagccgtac ccaaagtctc gcttctgccg aggtgtccct gatgccaaaa ttcgcatttt 120 

tgacctgggg cggaaaaang caaaantgga tgagtctccg ctttgtggcc acatggtgtc 180 

agatcaatat gagcagctgt cctctgaagc cctgnangct gcccgaattt gtgccaataa 240 

gtacatggta aaaagtngtg gcnaagatgc ttccatatcc gggtgcggnt ccaccccttc 300 

cacgtcatcc gcatcaacaa gatgttgtcc tgtgctgggg ctgacaggct cccaacaggc 360 

atgcgaagtg cctttggaaa acccanggca ctgtggccag ggttcacatt gggccaattn 420 
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atcatgttca tccgcaccaa ctgcagaaca angaacntgt naattnaagc cctgcccagg 480 
gncaanttca aatttcccgg cc 502 

<210> 54 

<211> 494 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 431, 442, 445 
<223> n = A,T,C or G 

<400> 54 

actagtccaa gaaaaatatg cttaatgtat attacaaagg ctttgtatat gttaacctgt 60 

tttaatgcca aaagtttgct ttgtccacaa tttccttaag acctcttcag aaagggattt 120 

gtttgcctta atgaatactg ttgggaaaaa acacagtata atgagtgaaa agggcagaag 180 

caagaaattt ctacatctta gcgactccaa gaagaatgag tatccacatt tagatggcac 240 

attatgagga ctttaatctt tccttaaaca caataatgtt ttcttttttc ttttattcac 300 

atgatttcta agtatatttt tcatgcagga cagtttttca accttgatgt acagtgactg 360 

tgttaaattt ttctttcagt ggcaacctct ataatcttta aaatatggtg agcatcttgt 420 

ctgttttgaa ngggatatga cnatnaatct atcagatggg aaatcctgtt tccaagttag 480 

aaaaaaaaaa aaaa 494 

<210> 55 

<211> 606 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 375, 395, 511, 542, 559, 569, 578, 581 
<223> n «■ A,T,C or G 

<400> 55 

actagtaaaa agcagcattg ccaaataatc cctaattttc cactaaaaat ataatgaaat 60 

gatgttaagc tttttgaaaa gtttaggtta aacctactgt tgttagatta atgtatttgt 120 

tgcttccctt tatctggaat gtggcattag cttttttatt ttaaccctct ttaattctta 180 

ttcaattcca tgacttaagg ttggagagct aaacactggg atttttggat aacagactga 240 

cagttttgca taattataat cggcattgta catagaaagg atatggctac cttttgttaa 300 

atctgcactt tctaaatatc aaaaaaggga aatgaagtat aaatcaattt ttgtataatc 360 

tgtttgaaac atgantttta tttgcttaat attanggctt tgcccttttc tgttagtctc 420 

ttgggatcct gtgtaaaact gttctcatta aacaccaaac agttaagtcc attctctggt 4 80 

actagctaca aattccgttt catattctac ntaacaattt aaattaactg aaatatttct 540 

anatggtcta cttctgtcnt ataaaaacna aacttgantt nccaaaaaaa aaaaaaaaaa 600 
aaaaaa 606 

<210> 56 

<211> 183 

<212> DNA 

<213> Homo sapiens 

<400> 56 

actagtatat ttaaacttac aggcttattt gtaatgtaaa ccaccatttt aatgtactgt 60 
aattaacatg gttataatac gtacaatcct tccctcatcc catcacacaa ctttttttgt 120 
gtgtgataaa ctga.ttttgg tttgcaataa aaccttgaaa aataaaaaaa aaaaaaaaaa 180 
aaa 183 
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<210> 57 

<211> 622 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 358, 368, 412, 414, 425, 430, 453, 455, 469, 475, 495, 499, 
529, 540, 564, 575, 590 
<223> n = A, T,C or G 



<400> 57 

actagtcact actgtcttct ccttgtagct aatcaatcaa tattcttccc ttgcctgtgg 60 

gcagtggaga gtgctgctgg gtgtacgctg cacctgccca ctgagttggg gaaagaggat 120 

aatcagtgag cactgttctg ctcagagctc ctgatctacc ccacccccta ggatccagga 180 

ctgggtcaaa gctgcatgaa accaggccct ggcagcaacc tgggaatggc tggaggtggg 24 0 

agagaacctg acttctcttt ccctctccct cctccaacat tactggaact ctatcctgtt 300 

agggatcttc tgagcttgtt tccctgctgg gtgggacaga agacaaagga gaagggangg 360 

tctacaanaa gcagcccttc tttgtcctct ggggttaatg agcttgacct ananttcatg 420 

gaganaccan aagcctctga tttttaattt ccntnaaatg tttgaagtnt atatntacat 480 

atatatattt ctttnaatnt ttgagtcttt gatatgtctt aaaatccant ccctctgccn 540 

gaaacctgaa ttaaaaccat gaanaaaaat gtttncctta aagatgttan taattaattg 600 

aaacttgaaa aaaaaaaaaa aa 622 

<210> 58 

<211> 433 

<212> DNA 

<213> Homo sapiens 

<400> 58 

gaacaaattc tgattggtta tgtaccgtca aaagacttga agaaatttca tgattttgca 60 

gtgtggaagc gttgaaaatt gaaagttact gcttttccac ttgctcatat agtaaaggga 120 

tcctttcagc tgccagtgtt gaataatgta tcatccagag tgatgttatc tgtgacagtc 180 

accagcttta agctgaacca ttttatgaat accaaataaa tagacctctt gtactgaaaa 240 

catatttgtg actttaatcg tgctgcttgg atagaaatat ttttactggt tcttctgaat 300 

tgacagtaaa cctgtccatt atgaatggcc tactgttcta ttatttgttt tgacttgaat 360 

ttatccacca aagacttcat ttgtgtatca tcaataaagt tgtatgtttc aactgaaaaa 420 
aaaaaaaaaa aaa 433 

<210> 59 

<211> 649 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 22, 190, 217, 430, 433, 484, 544, 550, 577, 583, 594 
<223> n = A,T,C or G 



<4O0> 59 

actagttatt atctgacttt cnggttataa 
tgtcatttgg atttgcattt ctctgatgag 
ttggccatat gtgtatgttc cctggagaag 
attaggcgtn tgtcttttta ttactgagtt 
gacccttatc agatacatgg tttgcaaata 
ctttatcgat aatgtcctta gacatataat 
ggctgtgcaa ggtgggctca cgcttgtaat 
atcatatgan gangctagga gttcgaggtc 



tcattctaat gagtgtgaag tagcctctgg 60 
tgatgctatc aagcaccttt gctggtgctg 120 
tgtctgtgct gagccttggc ccacttttta 180 
gtaaganttc tttatatatt ctggattcta 240 
ttttctccca ttctgtgggt tgtgttttca 300 
aaatttgtat tttaaaagtg acttgatttg 360 
cccagcactt tgggagactg aggtgggtgg 420 
agcctggcca gcatagcgaa aacttgtctc 480 
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tacnaaaaat acaaaaatta gtcaggcatg gtggtgcacg tctgtaatac cagcttctca 54 0 
ggangctgan gcacaaggat cacttgaacc ccagaangaa gangttgcag tganctgaag 600 
atcatgccag ggcaacaaaa atgagaactt gtttaaaaaa aaaaaaaaa 64 9 

<210> 60 

<211> 423 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 209, 222, 277, 389, 398 

<223> n = A,T,C or G 

<400> 60 

actagttcag gccttccagt tcactgacaa acatggggaa gtgtgcccag ctggctggaa 60 
acctggcagt gataccatca agcctgatgt ccaaaagagc aaagaatatt tctccaagca 120 
gaagtgagcg ctgggctgtt ttagtgccag gctgcggtgg gcagccatga gaacaaaacc 180 
tcttctgtat tttttttttc cattagtana acacaagact cngattcagc cgaattgtgg 240 
tgtcttacaa ggcagggctt tcctacaggg ggtgganaaa acagcctttc ttcctttggt 300 
aggaatggcc tgagttggcg ttgtgggcag gctactggtt tgtatgatgt attagtagag 360 
caacccatta atcttttgta gtttgtatna aacttganct gagaccttaa acaaaaaaaa 420 



<210> 61 

<211> 423 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 195, 285, 295, 329, 335, 340, 347, 367, 382, 383, 391, 396, 
418 

<223> n A,T,C or G 



<400> 61 

cgggactgga atgtaaagtg aagttcggag ctctgagcac gggctcttcc cgccgggtcc 60 

tccctcccca gaccccagag ggagaggccc accccgccca gccccgcccc agcccctgct 120 

caggtctgag tatggctggg agtcgggggc cacaggcctc tagctgtgct gctcaagaag 180 

actggatcag ggtanctaca agtggccggg ccttgccttt gggattctac cctgttccta 240 

atttggtgtt ggggtgcggg gtccctggcc cccttttcca cactncctcc ctccngacag 300 

caacctccct tggggcaatt gggcctggnt ctccncccgn tgttgcnacc ctttgttggt 360 

ttaaggnctt taaaaatgtt annttttccc ntgccngggt taaaaaagga aaaaactnaa 420 



<210> 62 

<211> 683 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 218, 291, 305, 411, 416, 441, 443, 453, 522, 523, 536, 542, 
547, 566, 588, 592, 595, 603, 621, 628, 630, 632, 644, 645, 
648, 655, 660, 672, 674, 676, 677, 683 
<223> n - A,T,C or G 



<400> 62 
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gctggagagg ggtacggact 
gaagagaccc taagagactg 
gctgtcaaca cttaaaggaa 
tggggccact ggccatggtc 
ggatcaaaat gtgtacttgt 
tgtcnttgga ctttcttccc 
ccctcctgtg tttttggaat 
atgaacttat gtttggggtc 
atttattttt gaaatatttt 
cnttttnttt ggggggggtg 
ttnttacttg gggcccccct 
ctaaaaaaaa ananannaaa 

<210> 63 . 
<211> 731 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 237, 249, 263, 288, 312, 317, 323, 326, 337, 352, 362, 370, 
377, 400, 411, 414, 434, 436, 446, 457, 473, 486, 497, 498, 
502, 512, 531, 546, 554, 563, 565, 566, 588, 597, 608, 611, 
613, 615, 627, 632, 640, 641, 644, 654, 660, 663, 665 
<223> n = A,T,C or G 

<221> misc feature 

<222> 671,~678, 692, 697, 698, 699, 704, 705, 712, 714, 717, 718, 
719, 723, 725, 730, 731 
<223> n = A, T,C or G 

<400> 63 

actagtcata aagggtgtgc gcgtcttcga cgtggcggtc ttggcgccac tgctgcgaga 60 

cccggccctg gacctcaagg tcatccactt ggtgcgtgat ccccgcgcgg tggcgagttc 120 

acggatccgc tcgcgccacg gcctcatccg tgagagccta caggtggtgc gcagccgaga 180 

ccgcgagctc accgcatgcc cttcttggag gccgcgggcc acaagcttgg cgcccanaaa 240 

gaaggcgtng ggggcccgca aantaccacg ctctgggcgc tatggaangt cctcttgcaa 300 

taatattggt tnaaaanctg canaanagcc cctgcanccc cctgaactgg gntgcagggc 360 

cncttacctn gtttggntgc ggttacaaag aacctgtttn ggaaaaccct nccnaaaacc 420 

ttccgggaaa attntncaaa tttttnttgg ggaattnttg ggtaaacccc ccnaaaatgg 480 

gaaacntttt tgccctnnaa antaaaccat tnggttccgg gggccccccc ncaaaaccct 540 

tttttntttt tttntgcccc cantnncccc ccggggcccc tttttttngg ggaaaanccc 600 

cccccctncc nanantttta aaagggnggg anaatttttn nttncccccc gggncccccn 660 

ggngntaaaa nggtttcncc cccccgaggg gnggggnnnc ctcnnaaacc cntntcnnna 720 

ccncnttttn n 731 

<210> 64 

<211> 313 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> 240 

<223> n = A,T,C or G 
<400> 64 

actagttgtg caaaccacga ctgaagaaag acgaaaagtg ggaaataact tgcaacgtct 60 



ttcttggagt tgtcccaggt 
gggaatggtt cctgccttca 
gtccccttga agcccagagt 
cgtggacaag acattccngt 
ggggtctcgc cccttgccaa 
attccctcct ccccaaatgc 
tctgtttccc tcaaaattgt 
nangttcccc ttnccaatgc 
ttaatgaact tggaaaaaat 
gggggntggg ttaaaatttt 
naaaaaantn anttccaatt 
aan 



tggaatgaga ctgaactcaa 60 
ggaaagtgaa agacgcttag 120 
ggacagacta gacccattga 180 
gggccatggc acaccggggg 240 
aaccaaacca ntcccactcc 300 
acttcccctc ctccctctgc 360 
taatttttta nttttngacc 420 
atactaatat attaatggtt 480 
tnntggaatt tccttncttc 540 
tttggaancc cnatnggaaa 600 
cttnnatngc ccctnttccn 660 

683 
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gttagagatg gttgctacac atgttgggtc tgtagagaaa catcttgagg agcagattgc 120 

taaagttgat agagaatatg aagaatgcat gtcagaagat ctctcggaaa atattaaaga 180 

gattagagat aagtatgaga agaaagctac tctaattaag tcttctgaag aatgaagatn 240 

aaatgttgat catgtatata tatccatagt gaataaaatt gtctcagtaa agttgtaaaa 300 

aaaaaaaaaa aaa 313 

<210> 65 

<211> 420 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 400, 402, 403, 404, 405, 406, 409, 411, 412, 414, 415, 416 
<223> n « A,T,C or G 

<400> 65 

actagttccc tggcaggcaa gggcttccaa ctgaggcagt gcatgtgtgg cagagagagg 60 
caggaagctg gcagtggcag cttctgtgtc tagggagggg tgtggctccc tccttccctg 120 
tctgggaggt tggagggaag aatctaggcc ttagcttgcc ctcctgccac ccttcccctt 180 
gtagatactg ccttaacact ccctcctctc tcagctgtgg ctgccaccca agccaggttt 240 
ctccgtgctc actaatttat ttccaggaaa ggtgtgtgga agacatgagc cgtgtataat 300 
atttgtttta acattttcat tgcaagtatt gaccatcatc cttggttgtg tatcgttgta 360 
acacaaatta atgatattaa aaagcatcca aacaaagccn annnnnaana nnannngaaa 420 



<210> 66 

<211> 676 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 328, 454, 505, 555, 586, 612, 636, 641 
<223> n » A, T,C or G 

<400> 66 

actagtttcc tatgatcatt aaactcattc tcagggttaa gaaaggaatg taaatttctg 60 

cctcaatttg tacttcatca ataagttttt gaagagtgca gatttttagt caggtcttaa 120 

aaataaactc acaaatctgg atgcatttct aaattctgca aatgtttcct ggggtgactt 180 

aacaaggaat aatcccacaa tatacctagc tacctaatac atggagctgg ggctcaaccc 24 0 

actgttttta aggatttgcg cttacttgtg gctgaggaaa aataagtagt tccgagggaa 300 

gtagttttta aatgtgagct tatagatngg aaacagaata tcaacttaat tatggaaatt 360 

gttagaaacc tgttctcttg ttatctgaat cttgattgca attactattg tactggatag 420 

actccagccc attgcaaagt ctcagatatc ttanctgtgt agttgaattc cttggaaatt 480 

ctttttaaga aaaaattgga gtttnaaaga aataaacccc tttgttaaat gaagcttggc 54 0 

tttttggtga aaaanaatca tcccgcaggg cttattgttt aaaaanggaa ttttaagcct 600 

ccctggaaaa anttgttaat taaatgggga aaatgntggg naaaaattat ccgttagggt 660 

ttaaagggaa aactta 676 

<210> 67 

<211> 620 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 419, 493, 519, 568, 605, 610 
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<223> a » A,T,C or G 
<400> 67 

caccattaaa gctgcttacc aagaacttcc 
gaattgtgag caggtgatag aagagccttt 
acattccatt taatgaaggg gttacatctg 
taggggaaaa aaatctgatc agaacgcatc 
agattgtagt gctgtggtgg tttattccgt 
cccaaagaga ggaaattata ggttagttaa 
cacttttgaa gtgttttgtt ttttattttt 
ctaaaaaaaa agggatatca atctctaatt 
tctttactgg aanttatggg actttttaag 
ttgcatgggc cccttaaaat tgttgaangg 
ccccnttttn aaaatttgga 

<210> 68 

<211> 551 

<212> DNA 

<213> Homo sapiens 



ccagcatttt gacttccttg tttgatagct 60 
ctagttgaac atacagataa tttgctgaat 120 
ttacgaagct actaagaag.g agcaagagca 180 
aaactcacat gtgccccctc tactacaaac 240 
tgtgcagaac ttgcaagctg agtcactaaa 300 
acattgtaat cccaggaact aagtttaatt 360 
ggtttgtctg atttactttg ggggaaaang 420 
cagtgcccac taaaagttgt ccctaaaaag 480 
ctccaggtnt tttggtcctc caaattaacc 540 
cattcctgcc tctaagtttg gggaaaattc 600 

620 



<220> 

<221> misc_feature o 

<222> 286, 464, 480, 501, 502, 518, 528, 533, 536, 537, 538, 539, 
540, 541, 543, 544, 545, 547, 548, 549 
<223> n = A,T,C or G 



<400> 68 

actagtagct ggtacataat 
ctaatgctag accagtattt 
ttagaacaga cctctctgtg 
gtattggggt tgcaatgact 
tctgagactg tggtgaaact 
actcggcacc actttgatat 
tacagctgat ggaactcaat 
ttaaacctaa ttacatttgt 
cctatgtgct cccctccccc 
nannnannna a 



cactgaggag ctatttctta 
aagggctaat ctcacacctc 
caataacttg tggccactgg 
cccaagggcc aaaagagtta 
ccttccaagg ctgagggggt 
tcaacaagcc acttgaagcc 
ttgaaccttc aaaactttgt 
ctagcattgg atttggttcc 
nnatcttaat ttaaaccnca 



acatgctttt atagaccatg 60 
cttagctgta agagtctggc 120 
aaatccctgg gccggcattt 180 
aaggcacgac tgggatttct 240 
cagtangtgc tctgggaggg 300 
caattataaa attgttattt 360 
tagtttatcc tattatattg 420 
tgtngcatat gtttttttcn 480 
attttgcnat tcnccnnnnn 540 

551 



<210> 69 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> * 

<221> misc_feature 
<222> 235, 310, 323, 381 
<223> n = A,T,C or G 



<400> 69 

cagaaatgga aagcagagtt ttcatttctg 
gcagagtttt cattaaatcc ttttaccttt 
gtatgtggga tattgaatgt taaagggata 
aattaagcaa atgttaaaag ttttatatgc 
tgtgatacat tttttaagct tcagttgctt 
ggggagccan aaaccaatct acnatctctt 
aaaaataaat aaaaactatt nagaaattga 



tttataaacg tctccaaaca aaaatggaaa 60 
tttttttctt ggtaatcccc tcaaataaca 120 
tttttttcta ttatttttat aattgtacaa 180 
tttattaatg ttttcaaaag gtatnataca 240 
gtcttctggt actttctgtt atgggctttt 300 
tttgtttgcc aggacatgca ataaaattta 360 
aaaaaa 396 



<210> 70 



WO 02/47534 



29 



PCT7US01/47576 



<211> 536 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 388, 446, 455 
<223> n « A,T,C or G 



<400> 70 

actagtgcaa 

cttcgaaaga 

ggcgtgacag 

ccactacccc 

aacagatatt 

tctgtgactg 

tcatgtctgt 

ttatagtcca 

aattgtataa 



aagcaaatat 
cccctgtaaa 
gctggaagag 
gttttctctt 
tttgtttctc 
cttgctgact 
gacttcattt 
gttcttatca 
gaataaaagt 



aaacatcgaa 
agagcccaac 
caaatgctgc 
cttgctgcaa 
atcttaacta 
ttatcataat 
ttaaatgnta 
acattnaaac 
tagaatttaa 



aaggcgttcc 
agtgaaaatg 
tgagcattct 
aataaaccac 
tccaagccac 
tttcttcaaa 
cttgctcagc 
ctatngcaat 
caattaaaaa 



tcacgttagc 
tagatatcag 
cctgttccat 
tctgtccatt 
ctattttatt 
caaaaaaatg 
tcaactgcat 
catttcaaat 
aaaaaaaaaa 



tgaagatatc 60 
cagtggagga 120 
cagttgccat 180 
tttaactcta 240 
tgttctttca 300 
tatagaaaaa 360 
ttcagttgtt 420 
ctattctgca 480 
aaaaaa 536 



<210> 71 

<211> 865 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 22, 35, 39, 56, 131, 138, 146, 183, 194, 197, 238, 269, 277, 
282, 297, 316, 331, 336, 340, 341, 346, 349, 370, 376, 381, 
382, 392, 396, 397, 401, 433, 444, 445, 454, 455, 469, 472, 
477, 480, 482, 489, 497, 499, 511, 522, 526, 527 
<223> n - A,T,C or G 

<221> misc_feature 

<222> 545, 553, 556, 567, 574, 580, 610, 613, 634, 638, 639, 663, 
672, 689, 693, 694, 701, 704, 713, 723, 729, 732, 743, 744, 
749, 761, 765, 767, 769, 772, 774, 780, 783, 788, 792, 803, 
810, 824, 840, 848 
<223> n = A f T,C or G 



<400> 71 

gacaaagcgt 

cccaccagca 

ggattaatct 

tcncaccaag 

gaaactggtg 

cagagctgga 

gaagatggan 

attcccgctg 

tncaacatng 

acaanctctc 

cacgccaagn 

ganggttatc 

ccncctatnt 

ttngttgncc 

aagggtgntt 



taggagaaga 
accagcgccc 
nacctctntc 
aganaanctg 
accaatctgc 
tatgangcca 
gacccncgac 
aangaatctc 
ggattanang 
ccnaanaaac 
aantataaaa 
cnccttgcgt 
cnagccgaac 
cngccctttc 
ggccccctcc 



anagaggcag 
cccaccagcc 
gcctgnccca 
ctgccaacac 
agaattctna 
gaccatggac 
nngatcaggc 
tgannggctt 
ctgggaactg 
tggggcncct 

ggggggcccc 

accatggtnc 
tcnnatttnc 
cgncggaacn 
ctccc 



ggaanactnc 
cccaggcccg 
ttcctacctc 
caaccgcccc 
gaggaanaag 
nctacncccn 
cngctnncca 
ccannaaagc 
naaggggcaa 
catnggtggn 
tccncggnng 
ccnnttctgt 
ccgggggtgc 
cgtttccccg 



ccaggcacga 
gacgacgaag 
ggaggtggag 
agccctggcg 
cnaggggccc 
ncaatncana 
nccccccacc 
gcctccccnc 
ancctnnaat 
accaactatt 
accccctttt 
ntgnatgttt 
natcnantng 
ttantaacgg 



tggccncctt 60 
actccatcct 120 
gccggaaagg 180 
ggcacganag 240 
cgcgctnaga 300 
cgggactgcg 360 
cctatgaatt 420 
cnaacgnaan 480 
atccccagaa 540 
aactaaaccg 600 
gtcccttaat 660 
ccnctcccct 720 
tncncctttn 780 
cacccggggn 840 
865 



<210> 72 
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<211> 560 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 83, 173, 183, 186, 209, 211, 215, 255, 321, 322, 323, 335, 
344, 357, 361, 368, 394, 412, 415, 442, 455, 469/ 472, 475, 
487, 513, 522, 528, 531, 534, 546 
<223> n = A,T,C or G 



<400> 72 

cctggacttg 

aaaagacagt 

ccatgcccaa 

tcnaantgct 

cagcagtgga 

gcaccacaaa 

ngcctgtnaa 

cctgaaagga 

actgatnctt 

tttccntttc 



tcttggttcc 
gtccagtgct 
cttctctggc 
gggggtgaat 
gatcnaacag 
gattaacttc 
aacctggtga 
gaaggccccc 
gaaccctgaa 
cccaaaaaaa 



agaacctgac 
ccngcctagg 
aactggaaaa 
gtgatgctna 
gagggagaca 
nnngttgggg 
aatgggagaa 
anaactcctg 
cgggcgggat 



gacccggcga 
agtctacggg 
tcatccgatc 
ngaanattgc 
ctttctacat 
aggantttga 
tganaataaa 
gaccngaaaa 
ganccttttt 



cggcgacgtc 
gaccgcctcc 
ggaaaacttc 
tgtggctgca 
caaaacctcc 
ggancaaact 
atggtctgtg 
actgacccnc 
tnttgccncc 



tcttttgact 60 
cgcgccgcca 120 
gangaattgc 180 
gcgtccaagc 240 
accaccgtgc 300 
gtggatngga 360 
ancanaaact 420 
cnatngggga 480 
naangggttc 540 
560 



<210> 73 

<211> 379 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 8, 17, 18, 21, 26, 29, 30, 32, 53, 56, 67, 71, 81, 102, 104, 
111, 112, 114, 119, 122, 124, 125, 134, 144, 146, 189, 190, 
214, 215, 219, 220, 235, 237, 246, 280, 288, 302, 310, 313, 
319, 322, 343, 353, 354 
<223> n = A,T,C or G 



<400> 73 

ctggggancc ggcggtnngc 
aaccgcncaa naaacatgcc 
gnanngagga acanaacaaa 
ttggccacnn gtggaattaa 
ataagngacc ctttatttca 
tnccacgtan agntggaant 
ttgttcaaaa aaaaaataa 

<210> 74 

<211> 437 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 145, 355 
<223> n = A, T,C or G 



nccatntcnn gncgcgaagg 
naagatatgg acgaggaaga 
ctcnangagc tctcaagcta 
gaaatctggc aaanngtann 
tctgtattta aacctctctn 
anttgttgtc ttggactgtt 



tggcaataaa aanccnctga 60 
tngngctttc nngnacaanc 120 
atgccgcggg gaaggggccc 180 
tgttccttgt gcctnangag 240 
ttccctgnca taacttcttt 300 
gtncatttta gannaaactt 360 

379 



<400> 74 

actagttcag actgccacgc caaccccaga 
ctaggtgttt ccatctatgt ttcaatctgt 



aaatacccca catgccagaa aagtgaagtc 60 
ccatctacca ggcctcgcga taaaaacaaa 120 



WO 02/47534 



31 



PCT/US01/47576 



acaaaaaaac 
caccagggtt 
aatcactgaa 
gaataagtta 
gtcatttgta 
aaaaaaaaaa 



gctgccaggt 
cttttgaaat 
ttgtcaggct 
taatcagtat 
ctgtttgaaa 
aaaaaaa 



tttanaagca 
agtaccacat 
ttgattgata 
tcatctcttt 
aatatttctt 



gttctggtct 
gtaaaaggga 
attgtagaaa 
gttttttgtc 
ctataaaatt 



caaaaccatc 
atttggcttt 
taagtagcct 
actcttttct 
aaactaacct 



aggatcctgc 180 
cacttcatct 240 
tctgttgtgg 300 
ctctnattgt 360 
gccttaaaaa 420 
437 



<210> 75 

<2U> 579 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 440, 513, 539, 551 
<223> n - A,T,C or G 



<400> 75 

ctccgtcgcc 

gacccagcac 

ccctgtgttt 

caaggtgcac 

tgaaaacaag 

gacctatttc 

cctccgtcta 

ccttggggtg 

gtgttgattc 

gatatatttt 



gccaagatga 
atcgccgacc 
aaggccgtgt 
gtcggcgacg 
cccttgacct 
tgatcctgac 
ccagagcgtg 
gaaggggcan 
tttccttcca 
naaaatcctt 



tgtgcggggc 
aggtgaggtc 
cattcaagag 
aggacttcgt 
tatctaacta 
tttggacaag 
cacttgtgat 
gatctgcact 
ataggtgatc 
aaaaaaaaaa 



gccctccgcc 
ccagcttgaa 
ccaggtggtc 
acacctgcga 
ccagaccaac 
gcccttcagc 
cctaaaataa 
gcttttgcat 
ttnattactt 
aaaaaaaaa 



acgcagccgg 
gagaaagaaa 
gcggggacaa 
gtgttccaat 
aaagccaagc 
cagaagactg 
gcttcatctc 
ttctcttcct 
tcagaatatt 



ccaccgccga 60 
acaagaagtt 120 
actacttcat 180 
ctctccctca 240 
atgatgagct 300 
acaaagtcat 360 
cgggctgtgc 420 
aaatttcatt 480 
ttccaaatna 54 0 
579 



<210> 76 

<211>* 666 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> 411, 470, 476, 491, 506, 527, 560, 570, 632, 636, 643, 650, 
654, 658 

<223> n - A,T,C or G 



<400> 76 

gtttatccta 

tccctgtttc 

ttgatgttgt 

ttcctggcta 

ctcactacag 

cagcttctcc 

taaaaaatat 

cagccagtga 

ttctcaataa 

tcattttagg 

atatcaatta 

cttaaa 



tctctccaac 
ttccacagtg 
tatgggcagg 
ctccatgttg 
ggaccaggga 
aacaataaaa 
acagtttacc 
acaacctttt 
ncctcacttt 
caaatatgan 
ccacccccat 



cagattgtca 
cctaataata 
atggcaacca 
gctagcctct 
tgatgcaaca 
agcacgtggt 
gaaaatcata 
cccaccatac 
cttaanatct 
ttttattgtn 
ctcccatgaa 



gctccttgag 
ctgtggaact 
gaccattgtc 
ggtaacctct 
tccttgtctt 
aaaacacttg 
ttatcttaca 
aaaaattcct 
tacaagatag 
cgttacttgt 
anaaanggga 



ggcaagagcc 
aggttttaat 
tcagagcagg 
tacttattat 
tttatgacag 
cggatattct 
atgaaaagga 
tttcccgaan 
ccccganatc 
ttcaaaattt 
aanggtgaan 



acagtatatt 60 
aattttttaa 120 
tgctggctct 180 
cttcaggaca 24 0 
gatgtttgct 300 
ggac.tgtttt 360 
ntttatagat 420 
gaaaanggct 480 
ttatcgaaac 54 0 
ggtattgtga 600 
ttcntaancg 660 
666 



<210> 77 

<211> 396 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> mis cofeature 

<222> 31, 54, 125, 128, 136, 163, 168, 198 
<223> n = A,T,C or G 



<400> 77 

ctgcagcccg ggggatccac taatctacca nggttatttg gcagctaatt ctanatttgg 60 

atcattgccc aaagttgcac ttgctggtct cttgggattt ggccttggaa aggtatcata 120 

catanganta tgccanaata aattccattt ttttgaaaat canctccntg gggctggttt 180 

tggtccacag cataacangc actgcctcct tacctgtgag gaatgcaaaa taaagcatgg 240 

attaagtgag aagggagact ctcagccttc agcttcctaa attctgtgtc tgtgactttc, 300 

gaagtttttt aaacctctga atttgtacac atttaaaatt tcaagtgtac tttaaaataa 360 

aatacttcta atgggaacaa aaaaaaaaaa aaaaaa 396 

<210> 78 

<211> 793 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 309, 492, 563, 657, 660, 703, 708, 710, 711, 732, 740, 748, 
758, 762, 765, 787 
<223> n = A,T,C or G 



<400> 78 

gcatcctagc 

gaaaattcca 

taccacagtc 

gaccctctcc 

atataaatcc 

acacagtcna 

gcagtttgtc 

ccagtatgtc 

ggaagatatt 

atgaaaaagc 

tctgtcggct 

gacacctgat 

ttggttcaat 

aataatnttt 



cgccgactca 
gtgtcagcat 
aaacctggag 
agaggttggg 
aagacaagca 
gctttaaaga 
ctcctcaatc 
ccaggattat 
cnaaccgtct 
tctcaagttg 
tgaaaattga 
taggttttgg 
tntctttttn 
ggc 



cacaaggcag 
tcttgctcct 
ccaaaaagga 
gtgaccaact 
acaaaccctt 
aagtgtttgc 
tggtttatga 
gtttgttgac 
ctatgcttac 
ctnaaaatga 
aaccagaaaa 
ttatgttcac 
aaacaatntg 



gtgggtgagg 
tgtggccctc 
cacaaaggac 
catctggact 
gatgattatt 
tgaaaataaa 
aacaactgac 
ccatctctga 
aaactgcaga 
attgtaagaa 
atgtgaaaaa 
cactattttt 
tttctacntt 



aaatccagag 
tcctacactc 
tctcgaccca 
cagacatatg 
catcacttgg 
gaaatccaga 
aaacaccttt 
cagttgaagc 
tacgctctgt 
aaaaaatctc 
tggctattgt 
aanaaaanan 
gnganctgat 



ttgccatgga 60 
tggccagaga 120 
aactgcccca 180 
aagaagctct 240 
atgagtgccc 300 
aattggcaga 360 
ctcctgatgg 420 
cgatatcctg 480 
tgcttgacac 540 
cagccttctg 600 
ggaacanatn 660 
nttttaaaat 720 
ttctaaaaaa 780 
793 



<210> 79 

<211> 456 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 89, 195, 255, 263, 266, 286, 353, 384, 423, 
<223> n m A,T,C or G 



425, 436, 441 



<400> 79 

actagtatgg ggtgggaggc cccacccttc 
ctccgtggag agggactggc agagctgang 
gcagctgttg agcgcaccta accactggtc 
tcctcccgac cccangacca ggctacttct 
tgcctctgat cgtangaatt gangantgtc 



tcccctaggc gctgttcttg ctccaaaggg 60 
ccacctgggg ctggggatcc cactcttctt 120 
atgcccccac ccctgctctc cgcacccgct 180 
cccctcctct tgcctccctc ctgcccctgc 24 0 
ccgccttgtg gctganaatg gacagtggca 300 
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ggggctggaa atgggtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gcnccccccc 360 
tgcaagaccg agattgaggg aaancatgtc tgctgggtgt gaccatgttt cctctccata 420 
aantncccct gtgacnctca naaaaaaaaa aaaaaa ,456 

<210> 80 

<211> 284 

<212> DNA 

<213> Homo sapiens 

<22 0> 

<221> misc_feature 
<222> 283 

<223> n « A,T,C or G 
<400> 80 

ctttgtacct ctagaaaaga taggtattgt gtcatgaaac ttgagtttaa attttatata 60 
taaaactaaa agtaatgctc actttagcaa cacatactaa aattggaacc atactgagaa 120 
gaatagcatg acctccgtgc aaacaggaca agcaaatttg tgatgtgttg attaaaaaga 180 
aataaataaa tgtgtatatg tgtaacttgt atgtttatgt ggaatacaga ttgggaaata 240 
aaatgtattt cttactgtga aaaaaaaaaa aaaaaaaaaa aana 284 

<210> 81 

<211> 671 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 388, 505, 600, 603, 615, 642, 644, 660 

<223> n - A,T,C or G 

<400> 81 

gccaccaaca ttccaagcta ccctgggtac ctttgtgcag tagaagctag tgagcatgtg 60 

agcaagcggt gtgcacacgg agactcatcg ttataattta ctatctgcca agagtagaaa 120 

gaaaggctgg ggatatttgg gttggcttgg ttttgatttt ttgcttgttt gtttgttttg 180 

tactaaaaca gtattatctt ttgaatatcg tagggacata agtatataca tgttatccaa 240 

tcaagatggc tagaatggtg cctttctgag tgtctaaaac ttgacacccc tggtaaatct 300 

ttcaacacac ttccactgcc tgcgtaatga agttttgatt catttttaac cactggaatt 360 

tttcaatgcc gtcattttca gttagatnat tttgcacttt gagattaaaa tgccatgtct 420 

atttgattag tcttattttt ttatttttac aggcttatca gtctcactgt tggctgtcat 480 

tgtgacaaag tcaaataaac ccccnaggac aacacacagt atgggatcac atattgtttg 540 

acattaagct ttggccaaaa aatgttgcat gtgttttacc tcgacttgct aaatcaatan 600 

canaaaggct ggctnataat gttggtggtg aaataattaa tnantaacca aaaaaaaaan 660 

aaaaaaaaaa a 671 

<210> 82 

<211> 217 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 35 

<223> n « A,T,C or G 
<400> 82 

ctgcagatgt ttcttgaatg ctttgtcaaa ttaanaaagt taaagtgcaa taatgtttga 60 
agacaataag tggtggtgta tcttgtttct aataagataa acttttttgt ctttgcttta 120 
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tcttattagg gagttgtatg tcagtgtata aaacatactg tgtggtataa caggcttaat 180 
aaattcttta aaaggaaaaa aaaaaaaaaa aaaaaaa 217 

<210> 83 

<211> 460 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<222> 104, 118, 172, 401, 422, 423, 
<223> n = A,T,C or G 



444, 449 



<400> 83 

cgcgagtggg 

aatggcagac 

aacggagacg 

gagtgaaatt 

cgtgatgtgg 

ctgggcactc 

gactgccaaa 

annataaaac 



agcaccagga 
aaaccagaca 
caggagaaga 
tcctaagatc 
aggaagagcc 
cgcgccgatg 
ttctccggtt 
acacctcgtg 



tctcgggctc 
tgggggaaat 
acaccctgcc 
ctggaggatt 
acctgcaaga 
ccaccggcct 
tgccccggga 
gcancaaana 



ggaacgagac 
cgccagcttc 
gaccaaagag 
tcctaccccc 
tggacacgag 
gtgggtctct 
tattatacaa 
aaaaaaaaaa 



tgcacggatt 
gatnaggcca 
accattgagc 
gtcctcttcg 
ccacaagctg 
gaagggaccc 
nattatttgt 



gttttaagaa 60 
agctgaanaa 120 
angagaagcg 180^ 
agaccccagt 240 
cactgtgaac 300 
cccccaatcg 360 
atgaataatg 420 
460 



<210> 84 

<211> 323 

<212> DNA 

<213> Homo sapiens 

<220>" 

<221> misc_feature 

<222> 70, 138, 178, 197, 228, 242, 244, 287, 311 
<223> n = A,T,C or G 



<400> 84 

tggtggatct 

gtggtccaan 

aattgaagtt 

gtccctgcgc 

cnancatctc 

atttcctgta 



tggctctgtg 
gcattttgct 
tacccganat 
aacaacnaac 
tctagctgac 
naaaaaaaaa 



gagctgctgg 
ggcttaacgg 
aacaatcttt 
aatctctggg 
cgatcatatc 
aaa 



gacgggatct 
gtcccggaac 
tgggcagaga 
aaataccggc 
gtcccagatt 



aaaagactat 
aaaggacacc 
tgcctatttt 
catgaacntg 
actacanatc 



tctggaagct 60 
agctctctaa 120 
aacaaacncc 180 
ctgtctcaat 240 
ataataattg 300 
323 



<210> 85 

<211> 771 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 63, 426, 471, 497, 521, 554, 583, 586, 606, 609, 615, 652, 
686, 691, 694, 695, 706, 713, 730, 732, 743, 751 
<223> n - A, T,C or G 



<400> 85 

aaactgggta 

aanagtttgc 

gaagcaagca 

attgtgggct 

cacacaaaga 



ctcaacactg 
tcctggctgc 
actttgactg 
tcacacggca 
aaaagttgtc 



agcagatctg 
tttgatgtca 
ctgtcttgga 
gctggccaat 
tgtgtgcgca 



ttctttgagc 
gtgctgctac 
tacacagacc 
gaaggctgtg 
aatccaaaac 



taaaaaccat 
tccacctctg 
gtattcttca 
acatcaatgc 
agacttgggt 



gtgctgtacc 60 
cggcgaatca 120 
tcctaaattt 180 
tatcatcttt 240 
gaaatatatt 300 
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gtgcgtctcc 
attggacata 
atgganggtt 
atcatattgc 
gttatttata 
ttggtntant 
gccacaagct 
tttgcttttn 



tcagtaaaaa 
gcccaagaac 
tagtgcttat 
atcatanttt 
gctntaggtt 
gcaanttaaa 
ttttttaaaa 
antagaaaat 



agtcaagaac 
agaaagaact 
cttatttgtg 
gctttgttta 
ttctgtgttt 
aattatattt 
aaccantaca 
ttnttagaac 



atgtaaaaac 
tgctggggtt 
cctcctggac 
acatcacatt 
aactttttat 
ggggggggaa 
nccnngttaa 
natttgaaaa 



tgtggctttt 
ggaggtttca 
ttgtccaatt 
naaattaaac 
acnaantttc 
taaatattgg 
atggtnggtc 
aaaaaaaaaa 



ctggaatgga 360 
cttgcacatc 420 
natgaagtta 48 0 
tgtattttat 540 
ctaaactatt 600 
antttctgca 660 
ccnaatggtt 720 
a 771 



<210> 86 

<211> 628 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 
<222> 162, 249, 266, 348, 
598, 611, 617, 621, 624 
<223> n s A,T,C or G 



407, 427, 488, 518, 545, 566, 569, 597, 



<400> 86 

actagtttgc 

cttgtgttag 

attatcttaa 

agttcataca 

gtggagaang 

aatctggggt 

gaaatattaa 

ttccctnggg 

tcctttcnca 

catcntctgc 

ccaaggaatt 



tttacatttt 
gtaagaatgg 
agctgaagcc 
ttcaaagcat 
aaatagatta 
tgaaattttc 
tgtttacctt 
gatggggaat 
gtttctggct 
tttattccca 
nagtggnttc 



tgaaaagtat 
aatttattaa 
aaaatatgct 
ctgaactgta 
atgtcnaagt 
tagttttcat 
tcaatgtgtg 
ggattattgg 
cctaccctac 
ttaatnaant 
ntcnttgt 



tatttttgtc 
gtgaatcagt 
tcaaaagaaa 
gtttctatag 
atgattggtg 
tctgtacatt 
gtatcagctg 
aaaatggaaa 
tgatttancc 
tttgatgaat 



caagtgctta 
gtgacccttc 
angactttat 
caagccaatt 
gagggagcaa 
tttagttnga 
gactcantaa 
gaaaaaagta 
agaataagaa 
aaatctgctt 



tcaactaaac 60 
ttgtcataag 120 
tgttcattgt 180 
acatccataa 240 
ggttgaagat 300 
catcagattt 360 
cacccctttc 420 
cttaaagcct 480 
aacattttat 54 0 
ttatgcnnac 600 
628 



<210> 87 

<211> 518 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> miscJEeature 
<222> 384, 421, 486 
<223> n = A,T,C or G 

<400> 87 

ttttttattt tttttagaga gtagttcagc 
tataacaaca ttatactgtt tatggtttaa 
agtagtacag ttttaaaatt ttatgcttaa 
ttttacatgg caaatcaatt tttaagtcat 
aaacacattt aatttcaatt tctctcttat 
ctacagttta acaatgcagc aaaattccca 
ggttaaaatg ctttgaggat cctnaatacc 
naatttaacc ctcatgccat aagcagaagc 
taaaancgag ccccccgttg aaaaagcaaa 

<210> 88 

<211> 1844 

<212> DNA 

<213> Homo sapiens 



ttttatttat aaatttattg cctgttttat 60 
tacatatggt tcaaaatgta taatacatca 120 
aacaagtttt gtgtaaaaaa tgcagataca 180 
cctaaaaatt gatttttttt tgaaatttaa 240 
ataaccttta ttactatagc atggtttcca 300 
tttcacggta aattgggttt taagcggcaa 360 
ctttgaactt caaatgaagg ttatggttgt 420 
acaagtttag ctgcattttg ctctaaactg 480 
agggaccc 518 



WO 02/47534 



36 



PCT/US01/47576 



<400>'88 

gagacagtga atcctagtat caaaggattt ttggcctcag aaaaagttgt tgattatttt 60 
tattttattt tatttttcga gactccgtct caaaaaaaaa aaaaaaaaaa agaatcacaa 120 
ggtatttgct aaagcatttt gagctgcttg gaaaaaggga agtagttgca gtagagtttc 180 
ttccatcttc ttggtgctgg gaagccatat atgtgtcttt tactcaagct aaggggtata 240 
agcttatgtg ttgaatttgc tacatctata tttcacatat tctcacaata agagaatttt 300 
gaaatagaaa tatcatagaa catttaagaa agtttagtat aaataatatt ttgtgtgttt 360 
taatcccttt gaagggatct atccaaagaa aatattttac actgagctcc ttcctacacg 420 
tctcagtaac agatcctgtg ttagtctttg aaaatagctc attttttaaa tgtcagtgag 480 
tagatgtagc atacatatga tgtataatga cgtgtattat gttaacaatg tctgcagatt 540 
ttgtaggaat acaaaacatg gcctttttta taagcaaaac gggccaatga ctagaataac 600 
acatagggca atctgtgaat atgtattata agcagcattc cagaaaagta gttggtgaaa 660 
taattttcaa gtcaaaaagg gatatggaaa gggaattatg agtaacctct attttttaag 720 
ccttgctttt aaattaaacg ctacagccat ttaagccttg aggataataa agcttgagag 780 
taataatgtt aggttagcaa aggtttagat gtatcacttc atgcatgcta ccatgatagt 840 
aatgcagctc ttcgagtcat ttctggtcat tcaagatatt cacccttttg cccatagaaa 900 
gcaccctacc tcacctgctt actgacattg tcttagctga tcacaagatc attatcagcc 960 
tccattattc cttactgtat ataaaataca gagttttata ttttcctttc ttcgtttttc 1020 
accatattca aaacctaaat ttgtttttgc agatggaatg caaagtaatc aagtgttcgt 1080 
gctttcacct agaagggtgt ggtcctgaag gaaagaggtc cctaaatatc ccccaccctg 1140 
ggtgctcctc cttccctggt accctgacta ccagaagtca ggtgctagag cagctggaga 1200 
agtgcagcag cctgtgcttc cacagatggg ggtgctgctg caacaaggct ttcaatgtgc 1260 
ccatcttagg gggagaagct agatcctgtg cagcagcctg gtaagtcctg aggaggttcc 1320 
attgctcttc ctgctgctgt cctttgcttc tcaacggggc tcgctctaca gtctagagca 1380 
catgcagcta acttgtgcct ctgcttatgc atgagggtta aattaacaac cataaccttc 14 40 
atttgaagtt caaaggtgta ttcaggatcc tcaaagcatt ttaaccttgc cgcttaaaac 1500 
ccaatttacc gtgaaatggg aattttgctg cattgttaaa ctgtagtgga aaccatgcta 1560 
tagtaataaa ggttatataa gagagaaatt gaaattaaat gtgtttttaa atttcaaaaa 1620 
aaaatcaatc tttaggatga cttaaaaatt gatttgccat gtaaaatgta tctgcatttt 1680 
ttacacaaaa cttgttttaa gcataaaatt ttaaaactgt actacttgat gtattataca 1740 
ttttgaacca tatgtattaa accataaaca gtataatgtt gttataataa aacaggcaat 1800 
aaatttataa ataaaagctg aaaaaaaaaa aaaaaaaaaa aaaa 1844 

<210> 89 

<211> 523 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 288, 352, 369, 398, 475, 511, 513 
<223> n = A,T,C or G 

<400> 89 

tttttttttt tttttttagt caatccacat ttattgatca cttattatgt accaggcact 60 
gggataaaga tgactgttag tcactcarcag taaggaagaa aactagcaaa taagacgatt 120 
acaatatgat gtagaaaatg ctaagccaga gatatagaaa ggtcctattg ggtccttctg 180 
tcaccttgtc tttccacatc cctacccttc acaggccttc cctccagctt cctgcccccg 240 
ctccccactg cagatcccct gggattttgc ctagagctaa acgagganat gggccccctg 300 
gccctggcat gacttgaacc caaccacaga ctgggaaagg gagcctttcg anagtggatc 360 
actttgatna gaaaacacat agggaattga agagaaantc cccaaatggc cacccgtgct 420 
ggtgctcaag aaaagtttgc agaatggata aatgaaggat caagggaatt aatanatgaa 480 
taattgaatg gtggctcaat aagaatgact ncnttgaatg acc 523 

<210> 90 
<211> 604 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 563 

<223> n = A,T,C or G 
<400> 90 

ccagtgtggt ggaatgcaaa gattaccccg gaagctttcg agaagctggg attccctgca 60 
gcaaaggaaa tagccaatat gtgtcgtttc tatgaaatga agccagaccg agatgtcaat 120 
ctcacccacc aactaaatcc caaagtcaaa agcttcagcc agtttatctc agagaaccag 180 
gggagccttc aagggcatgt agaaaatcag ctgttcagat aggcctctgc accacacagc 24 0 
ctctttcctc tctgatcctt ttcctcttta cggcacaaca ttcatgtttg acagaacatg 300 
ctggaatgca attgtttgca acaccgaagg atttcctgcg gtcgcctctt cagtaggaag 360 
cactgcattg gtgataggac acggtaattt gattcacatt taacttgcta gttagtgata 420 
aggggtggta cacctgtttg gtaaaatgag aagcctcgga aacttgggag cttctctcct 480 
accactaatg gggagggcag attattactg ggatttctcc tggggtgaat taatttcaag 540 
ccctaattgc tgaaattccc ctnggcaggc tccagttttc tcaactgcat tgcaaaattc 600 
cccc 604 

<210> 91 
<211> 858 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 570, 591, 655, 664, 667, 683, 711, 759, 760, 765, 777, 787, 
792, 794, 801, 804, 809, 817, 820 
<223> n = A,T,C or G 



<400> 91 

tttttttttt ttttttttta tgattattat tttttttatt gatctttaca tcctcagtgt 60 
tggcagagtt tctgatgctt aataaacatt tgttctgatc agataagtgg aaaaaattgt 120 
catttcctta ttcaagccat gcttttctgt gatattctga tcctagttga acatacagaa 180 
ataaatgtct aaaacagcac ctcgattctc gtctataaca ggactaagtt cactgtgatc 240 
ttaaataagc ttggctaaaa tgggacatga gtggaggtag tcacacttca gcgaagaaag 300 
agaatctcct gtataatctc accaggagat tcaacgaatt ccaccacact ggactagtgg 360 
atcccccggg ctgcaggaat tcgatatcaa gcttatcgat accgtcgacc tcgagggggg 420 
gcccggtacc caattcgccc tatagtgagt cgtattacgc gcgctcactg gccgtcgttt 4 80 
tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc 540 
cccctttcgc cagctggcgt aatagcgaan agcccgcacc gatcgccctt ncaacagttg 600 
cgcagcctga atggcgaatg ggacgcgccc tgtagcggcg cattaaagcg cggcngggtg 660 
tggnggntcc cccacgtgac cgntacactt ggcagcgcct tacgccggtc nttcgctttc 720 
ttcccttcct ttctcgcacc gttcgccggg tttccccgnn agctnttaat cgggggnctc 780 
cctttanggg tncnaattaa nggnttacng* gaccttngan cccaaaaact ttgattaggg 840 
ggaaggtccc cgaagggg 858 

<210> 92 

<211> 585 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 317, 319, 320, 321, 325, 327, 328, 330, 331, 332, 460, 462, 
483, 485, 487, 523, 538, 566, 584 
<223> n ■ A,T,C or G 
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<400> 92 

gttgaatctc ctggtgagat tatacaggag attctctttc ttcgctgaag tgtgactacc 60 

tccactcatg tcccatttta gccaagctta tttaagatca cagtgaactt agtcctgtta 120 

tagacgagaa tcgaggtgct gttttagaca tttatttctg tatgttcaac taggatcaga 180 

atatcacaga aaagcatggc ttgaataagg aaatgacaat tttttccact tatctgatca 240 

gaacaaatgt ttattaagca tcagaaactc tgccaacact gaggatgtaa agatcaataa 300 

aaaaaataat aatcatnann naaanannan nngaagggcg gccgccaccg cggtggagct 360 

ccagcttttg ttccctttag tgagggttaa ttgcgcgctt ggcgttaatc atggtcatag 420 

ctgtttcctg tgtgaaattg ttatccggct cacaattccn cncaacatac gagccgggaa 480 

gcntnangtg taaaagcctg ggggtgccta attgagtgag ctnactcaca ttaattgngt 540 

tgcgctccac ttgcccgctt ttccantccg ggaaacctgt tcgnc 585 

<210> 93 

<211> 567 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 82, 158, 230, 232, 253, 266, 267, 268, 269, 270, 271, 272, 

273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 

285, 286, 287, 295, 303, 307, 314, 349, 352, 354, 356, 366, 

369, 379, 382, 386, 393, 404, 427, 428, 446, 450, 452 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 453, 454, 459, 462, 480, 481, 483, 488, 493, 501, 509, 511, 
512, 518, 520, 525, 526, 532, 541, 557 
<223> n = A,T,C or 6 

<400> 93 

cggcagtgtt gctgtctgcg tgtccacctt ggaatctggc tgaactggct gggaggacca 60 
agactgcggc tggggtgggc anggaaggga accgggggct gctgtgaagg atcttggaac 120 
ttccctgtac ccaccttccc cttgcttcat gtttgtanag gaaccttgtg ccggccaagc 180 
ccagtttcct tgtgtgatac actaatgtat ttgctttttt tgggaaatan anaaaaatca 240 
attaaattgc tantgtttct ttgaannnnn nnnnnnnnnn nnnnnnnggg ggggncgccc 300 
ccncggngga aacnccccct tttgttccct ttaattgaaa ggttaattng cncncntggc 360 
gttaanccnt gggccaaanc tngttncccg tgntgaaatt gttnatcccc tcccaaattc 420 
ccccccnncc ttccaaaccc ggaaancctn annntgttna ancccggggg gttgcctaan 480 
ngnaattnaa ccnaaccccc ntttaaatng nntttgcncn ccacnngccc cnctttccca 54 0 
nttcggggaa aaccctntcc gtgccca 567 

<210> 94 

<211> 620 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 169, 171, 222, 472, 528, 559, 599 
<223> n = A,T,C or G 



<400> 94 

actagtcaaa aatgctaaaa taatttggga 
catgtttatc ttttattatg ttttgtgaag 
gccaatattt ccttatatct atccataaca 
gtgaaactta acactttata aggtaaaaat 



gaaaatattt tttaagtagt gttatagttt 60 
ttgtgtcttt tcactaatta cctatactat 120 
tttatactac atttgtaana naatatgcac 180 
gaggtttcca anatttaata atctgatcaa 24 0 
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gttcttgtta 
ataaggttaa 
tttcaagcct 
gagaatttct 
atgtctctaa 
tttcccttaa 
agggttaagg 



tttccaaata 
aagttgttaa 
tcgaactatt 
cattaatatc 
gaaagtacta- 
gtgtgaaant 
gtgttgggga 



gaatggactt 
tgaccaaaca 
taaggaaagc 
ctgaatcatt 
tttcatggtc 
atttaaaatg 



ggtctgttaa 
ttctaaaaga 
aaaatcattt 
catttcacta 
caaacctggt 
aaattttcct 



gggctaagga 
aatgcaaaaa 
cctaaatgca 
aggctcatgt 
tgccatantt 
ctttttaaaa 



gaagaggaag 300 
aaaagtttat 360 
tatcatttgt 420 
tnactccgat 480 
gggtaaaggc 54 0 
attctttana 600 
620 



<210> 95 

<211> 470 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 61, 67, 79, 89, 106, 213, 271, 281, 330, 354, 387, 432, 448 
<223> n = A,T,C or G 



<400> 95 

ctcgaccttc 

nactttntgc 

gaaacatgag 

agcaggtgaa 

agccatgcca 

ccaaggtccc 

gagccaggat 

ccaagggcct 



tctgcacagc 
ttaattcang 
ttcttaccag 
acaacccatc 
ctcaaaggtt 
tgagccaggg 
gtaccaaggt 
gngccaggca 



ggatgaaccc 
agcttacang 
cagaagcaga 
cagcctccac 
ccacaacctg 
ctgtaccaan 
ccctgancca 
gcatcaangt 



tgagcagctg 
attcttcaaa 
cctttacccc 
ctnaggaaat 
naaacacaaa 
gtccctgagc 
ggttgtccaa 
ccctgaccaa 



aagaccagaa 
gagtgngtcc 
accacctcag 
atttgttccc 
nattccagag 
caggttgtac 
ggtccctgag 
ggcttatcaa 



aagccactat 60 
agcatccttt 120 
cttcaacagc 180 
acaaccaagg 240 
ccaggctgta 300 
caangtccct 360 
ccaggctaca 420 
470 



<210> 96 

<211> 660 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 299, 311, 360, 426, 538, 540, 542, 553, 563, 565, 592, 603, 
604, 618, 633, 647, 649, 651, 653 
<223> n = A,T,C or G 



<400> 96 

tttttttttt 

gcatttcttt 

tgaagacttt 

gctttatagt 

tgtactgatt 

cagcatctgg 

cttctgctga 

gcctgncaca 

aaacttgatg 

ancctgggct 

gcnnagggac 



tttttttttt 
tcattcgaat 
ctgcttaatt 
acgtattttt 
acaaggtcta 
nggttggctt 
gctgggcctg 
ggaactttgg 
aagccttggt 
canggacctt 
ccttgggncc 



ggaattaaaa 
cttcagatga 
caggggctta 
aggatacaaa 
cagacaatta 
ctcaagggct 
gagtgaccgt 
tgtatccttg 
caagggacct 
tgncncaacc 
aaccctgggc 



gcaatttaat 
accctgagca 
caggattctt 
taagagagag 
agacacagaa 
tgtctgtgca 
tgaaggacat 
ctcaggaact 
tgatgcttgc 
ttggcttcaa 
ttnagggacc 



gagggcagag 
gccgaagacc 
cagagtgtgt 
actatggctt 
acagatggga 
ccaaattact 
ggctctggta 
ttgatggcac 
tggctcaggg 
gggacccttg 
ctttggntnc 



caggaaacat 6"0 
agaaaagcca 120 
gtgaacaaaa 180 
ggggtgagaa 240 
agagggtgnc 300 
tctgcttggn 360 
cctttgtgta 420 
ctggctcagg 4 80 
accttggngn 540 
gnacatcctg 600 
nanccttggc 660 



<210> 97 

<211> 441 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<222> 12, 308 

<223> n = A,T,C or G 



<400> 97 

gggaccatac anagtattcc tctcttcaca ccaggaccag ccactgttgc agcatgagtt 60 
cccagcagca gaagcagccc tgcatcccac cccctcagct tcagcagcag caggtgaaac 120 
agccttgcca gcctccacct caggaaccat gcatccccaa aaccaaggag ccctgccacc 180 
ccaaggtgcc tgagccctgc caccccaaag tgcctgagcc ctgccagccc aaggttccag 240 
agccatgcca ccccaaggtg cctgagccct gcccttcaat agtcactcca gcaccagccc 300 
agcagaanac caagcagaag taatgtggtc cacagccatg cccttgagga gccggccacc 360 
agatgctgaa tcccctatcc cattctgtgt atgagtccca tttgccttgc aattagcatt 420 
ctgtctcccc caaaaaaaaa a 441 

<210> 98 
<211> 600 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 295, 349, 489, 496, 583 

<223> n m A,T,C or G 



<400> 98 

gtattcctct cttcacacca 
gcagccctgc atcccacccc 
tccacctcag gaaccatgca 
gccctgccac cccaaagtgc 
caaggtgcct gagccctgcc 
gcagaagtaa tgtggtccac 
cctatcccat tctgtgtatg 
aaaagaatgt gctatgaagc 
ggtcttaant acaganctag 
tgaaaggcaa atgattcagc 



ggaccagcca ctgttgcagc 
ctcagcttca gcagcagcag 
tccccaaaac caaggagccc 
ctgagccctg ccagcccaag 
cttcaatagt cactccagca 
agccatgccc ttgaggagcc 
agtcccattt gccttgcaat 
tttctttcct acacactctg 
ttttcagctg ctcagaattc 
tccttattac cccattaaat 



atgagttccc agcagcagaa 60 
gtgaaacagc cttgccagcc 120 
tgccacccca aggtgcctga 180 
gttccagagc catgccaccc 240 
ccagcccagc agaanaccaa 300 
ggccaccana tgctgaatcc 360 
tagcattctg tctcccccaa 420 
agtctctgaa tgaagctgaa 480 
tctgaagaaa agatttaaga 54 0 
tcnctttcaa ttccaaaaaa 600 



<210> 99 
<211> 667 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 345, 562, 635 
<223> n = A,T,C or G 



<400> 99 

actagtgact gagttcctgg caaagaaatt 
accatttaaa aaaatcagtg aaggatttga 
ggtcctgacg ttttgagatc caaagtggca 
tttctcttgt gagagttccc tcatctgaaa 
agtagaagat ttgttgaaga catagaaccc 
ttaaagtctt gtgagcacct gggaattagt 
attttgtaag gctataattg tatcttttaa 
tggagatttt taagagtttt aaccagctgc 
gtataaagat atagtaaatg catctcctag 



tgacctggac cagttgataa ctcatgtttt 60 
gctgctcaat tcaggacaaa gcattcgaac 120 
ggaggtctgt gttgtcatgg tgaactggag 180 
tcatgtatct gtctcacaaa tacaagcata 240 
ttataaagaa ttattaacct ttataaacat 300 
ataataacaa tgttnatatt tttgatttac 360 
gaaaacatac cttggatttc tatgttgaaa 420 
tgcagatata ttactcaaaa cagatatagc 480 
agtaatattc acttaacaca ttggaaacta 54 0 



WO 02/47534 



41 



PCT/US01/47576 



ttatttttta gatttgaata tnaatgttat tttttaaaca cttgttatga gttacttggg 600 
attacatttt gaaatcagtt cattccatga tgcanattac tgggattaga ttaagaaaga 660 
cggaaaa 667 

<210> 100 
<211> 583 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 404, 506, 514, 527, 528, 538, 548, 556, 568, 569 
<223> n = A,T,C or G 

<400> 100 

gttttgtttg taagatgatc acagtcatgt tacactgatc taaaggacat atatataacc 60 
ctttaaaaaa aaaatcactg cctcattctt atttcaagat gaatttctat acagactaga 120 
tgtttttctg aagatcaatt agacattttg aaaatgattt aaagtgtttt ccttaatgtt 180 
ctctgaaaac aagtttcttt tgtagtttta accaaaaaag tgcccttttt gtcactggat 240 
tctcctagca ttcatgattt ttttttcata caatgaaatt aaaattgcta aaatcatgga 300 
ctggctttct ggttggattt caggtaagat gtgtttaagg ccagagcttt tctcagtatt 360 
tgattttttt ccccaatatt tgatttttta aaaatataca catnggtgct gcatttatat 420 
ctgctggttt aaaattctgt catatttcac ttctagcctt ttagttatgg caaatcatat 4 80 
tttactttta cttaaagcat ttggtnattt ggantatctg gttctannct aaaaaaanta 540 
attctatnaa ttgaantttt ggtactcnnc catatttgga tec 583 

<210> 101 
<211> 592 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<222> 218, 497, 502, 533, 544, 546, 548, 550, 555 
<223> n = A,T,C or G 

<400> 101 

gtggagacgt acaaagagca gccgctcaag acacctggga agaaaaagaa aggcaagccc 60 

gggaaacgea aggagcagga aaagaaaaaa cggcgaactc gctctgcctg gttagactct 120 

ggagtgactg ggagtgggct agaaggggac cacctgtctg acacctccac aacgtcgctg 180 

gagctcgatt caeggaggea ttgaaatttt cagcaganac cttccaagga catattgeag 240 

gattctgtaa tagtgaacat atggaaagta ttagaaatat ttattgtctg taaatactgt 300 

aaatgcattg gaataaaact gtctccccca ttgctctatg aaactgeaca ttggtcattg 360 

tgaatatttt tttttttgee aaggctaatc caattattat tatcacattt accataattt 420 

attttgtcca ttgatgtatt tattttgtaa atgtatcttg gtgctgctga atttctatat 480 

tttttgtaca taatgcnttt anatatacct atcaagtttg ttgataaatg aencaatgaa 540 

gtgnencnan ttggnggttg aatttaatga atgectaatt ttattatccc aa 592 

<210> 102 
<211> 587 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 91, 131, 256, 263, 332, 392, 400, 403, 461, 496, 497, 499, 
510, 511, 518, 519, 539, 554, 560, 576 
<223> n = A,T,C or G 
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<400> 102 

cgtcctaagc acttagacta catcagggaa gaacacagac cacatccctg tcctcatgcg 60 
gcttatgt'tt tctggaagaa agtggagacc nagtccttgg ctttagggct ccccggctgg 120 
gggctgtgca ntccggtcag ggcgggaagg gaaatgcacc gctgcatgtg aacttacagc 18 0 
ccaggcggat gccccttccc ttagcactac ctggcctcct gcatcccctc gcctcatgtt 240 
cctcccacct tcaaanaatg aanaacccca tgggcccagc cccttgccct ggggaaccaa 300 
ggcagccttc caaaactcag gggctgaagc anactattag ggcaggggct gactttgggt 360 
gacactgccc attccctctc agggcagctc angtcacccn ggnctcttga acccagcctg 420 
ttcctttgaa aaagggcaaa actgaaaagg gcttttccta naaaaagaaa aaccagggaa 480 
ctttgccagg gcttcnntnt taccaaaacn ncttctcnng gatttttaat tccccattng 540 
gcctccactt accnggggcn atgccccaaa attaanaatt tcccatc 587 

<210> 103 
<211> 496 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 2, 17, 66, 74, 82, 119, 164, 166, 172, 200, 203, 228, 232, 
271, 273, 415, 423, 445, 446, 473 
<223> n = A,T,C or G 

<400> 103 

anaggactgg ccctacntgc tctctctcgt cctacctatc aatgcccaac atggcagaac 60 

ctgcanccct tggncactgc anatggaaac ctctcagtgt cttgacatca ccctacccnt 120 

gcggtgggtc tccaccacaa ccactttgac tctgtggtcc ctgnanggtg gnttctcctg 180 

actggcagga tggaccttan ccnacatatc cctctgttcc ctctgctnag anaaagaatt 240 

cccttaacat gatataatcc acccatgcaa ntngctactg gcccagctac catttaccat 300 

ttgcctacag aatttcattc agtctacact ttggcattct ctctggcgat agagtgtggc 360 

tgggctgacc gcaaaaggtg ccttacacac tggcccccac cctcaaccgt tgacncatca 420 

gangcttgcc tcctccttct gattnncccc catgttggat atcagggtgc tcnagggatt 480 

ggaaaagaaa caaaac 4 96 

<210> 104 
<211> 575 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 18, 19, 45, 68, 77, 132, 155, 174, 219, 226, 238, 259, 263, 
271, 273, 306, 323, 339, 363, 368, 370, 378, 381, 382, 436, 
440, 449, 450, 456, 481, 485, 496, 503, 510, 512, 515, 528, 
542, 552 

<223> n = R,T,C or G 
<400> 104 

gcacctgctc tcaatccnnc tctcaccatg atcctccgcc tgcanaaact cctctgccaa 60 
ctatggangt ggtttcnggg gtggctcttg ccaactggga agaagccgtg gtgtctctac 120 
ctgttcaact cngtttgtgt ctgggggatc aactnggggc tatggaagcg gctnaactgt 180 
tgttttggtg gaagggctgg taattggctt tgggaagtng cttatngaag ttggcctngg 240 
gaagttgcta ttgaaagtng ccntggaagt ngntttggtg gggggttttg ctggtggcct 300 
ttgttnaatt tgggtgcttt gtnaatggcg gccccctcnc ctgggcaatg aaaaaaatca 360 
ccnatgcngn aaacctcnac nnaacagcct gggcttccct cacctcgaaa aaagttgctc 420 
cccccccaaa aaaggncaan cccctcaann tggaangttg aaaaaatcct cgaatgggga 480 
ncccnaaaac aaaaancccc ccntttcccn gnaanggggg aaataccncc cccccactta 540 
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cnaaaaccct tntaaaaaac cccccgggaa aaaaa 575 

<210> 105 
<211> 619 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 260, 527, 560, 564, 566, 585, 599 
<223> n = A,T,C or G 

<400> 105 

cactagtagg atagaaacac tgtgtcccga gagtaaggag agaagctact attgattaga 60 

gcctaaccca ggttaactgc aagaagaggc gggatacttt cagctttcca tgtaactgta 120 

tgcataaagc caatgtagtc cagtttctaa gatcatgttc caagctaact gaatcccact 180 

tcaatacaca ctcatgaact cctgatggaa caataacagg cccaagcctg tggtatgatg 240 

tgcacacttg ctagactcan aaaaaatact actctcataa atgggtggga gtattttggt 300 

gacaacctac tttgcttggc tgagtgaagg aatgatattc atatattcat ttattccatg 360 

gacatttagt tagtgctttt tatataccag gcatgatgct gagtgacact cttgtgtata 420 

tttccaaatt tttgtacagt cgctgcacat atttgaaatc atatattaag acttccaaaa 480 

aatgaagtcc ctggtttttc atggcaactt gatcagtaaa ggattcncct ctgtttggta 540 

cttaaaacat ctactatatn gttnanatga aattcctttt ccccncctcc cgaaaaaana 600 

aagtggtggg gaaaaaaaa 619 

<210> 106 
<211> 506 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 8, 21, 31, 32, 58, 75, 89, 96, 99, 103, 122, 126, 147, 150, 
158, 195, 210, 212, 219, 226, 246, 248, 249, 255, 258, 261, 
263, 265, 275, 304, 317, 321, 331, 337, 340, 358, 371, 377, 
380, 396, 450, 491 
<223> n = A,T,C or G 

<400> 106 

cattggtnct ttcatttgct ntggaagtgt nnatctctaa cagtggacaa agttcccngt 60 

gccttaaact ctgtnacact tttgggaant gaaaanttng tantatgata ggttattctg 120 

angtanagat gttctggata ccattanatn tgcccccngt gtcagaggct catattgtgt 180 

tatgtaaatg gtatntcatt cgctactatn antcaattng aaatanggtc tttgggttat 240 

gaatantnng cagcncanct nanangctgt ctgtngtatt cattgtggtc atagcacctc 300 

acancattgt aacctcnatc nagtgagaca nactagnaan ttcctagtga tggctcanga 360 

ttccaaatgg nctcatntcn aatgtttaaa agttanttaa gtgtaagaaa tacagactgg 420 

atgttccacc aactagtacc tgtaatgacn ggcctgtccc aacacatctc ccttttccat 480 
gactgtggta ncccgcatcg gaaaaa 506 

<210> 107 
<211> 452 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__feature 
<222> 289, 317, 378 
<223> n = A,T,C or G 
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<400>' 107 

gttgagtctg tactaaacag taagatatct caatgaacca taaattcaac tttgtaaaaa 60 

tcttttgaag catagataat attgtttggt aaatgtttct tttgtttggt aaatgtttct 120 

tttaaagacc ctcctattct ataaaactct gcatgtagag gcttgtttac ctttctctct 18 0 

ctaaggttta caataggagt ggtgatttga aaaatataaa attatgagat tggttttcct 240 

gtggcataaa ttgcatcact gtatcatttt cttttttaac cggtaagant ttcagtttgt 300 

tggaaagtaa ctgtganaac ccagtttccc gtccatctcc cttagggact acccatagaa 360 

catgaaaagg tccccacnga agcaagaaga taagtctttc atggctgctg gttgcttaaa 420 

ccactttaaa accaaaaaat tccccttgga aa 452 

<210> 108 
<211> 502 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 22, 31, 126, 168, 183, 205, 219, 231, 236, 259, 283, 295, 
296, 298, 301, 340, 354, 378, 383, 409, 433, 446, 455, 466, 
488 

<223> n = A,T,C or G 



<400> 108 

atcttcttcc cttaattagt tnttatttat 
caaaaagaga ttgtagattg gcttctggct 
agaccncaac tgaagcttaa aaaatctatc 
tanagcatat aaaactttta acatntgctt 
aaaatgtccc tttaacatnc aatatcccac 
naaaaaaagg gtagaaggga tttaatgaaa 
ctccagaaca aaaacttntc aantctttca 
aaactccatt agncccactt tctaanggtc 
accctggnta ctcctgccct ca 

<210> 109 

<211> 1308 

<212> DNA 

<213> Homo sapiens 



ntattaaatt ttattgcatg tcctggcaaa 60 
ccccaaaagc ccataacaga aagtaccaca 120 
acatgtataa tacctttnga agaacattaa 180 
aatgttgtnc aattataaaa ntaatngaaa 240 
atagtgttat ttnaggggat taccnngnaa 300 
actctgcttn ccatttctgt ttanaaacgt 360 
gctaaccgca tttgagctna ggccactcaa 420 
tctanagctt actaancctt ttgacccctt 480 

502 



<400> 109 

acccgaggtc tcgctaaaat catcatggat tcacttggcg ccgtcagcac tcgacttggg 60 
tttgatcttt tcaaagagct gaagaaaaca aatgatggca acatcttctt ttcccctgtg 120 
ggcatcttga ctgcaattgg catggtcctc ctggggaccc gaggagccac cgcttcccag 180 
ttggaggagg tgtttcactc tgaaaaagag acgaagagct caagaataaa ggctgaagaa 24 0 
aaagaggtga ttgagaacac agaagcagta catcaacaat tccaaaagtt tttgactgaa 300 
ataagcaaac tcactaatga ttatgaactg aacataacca acaggctgtt tggagaaaaa 360 
acatacctct tccttcaaaa atacttagat tatgttgaaa aatattatca tgcatctctg 420 
gaacctgttg attttgtaaa tgcagccgat gaaagtcgaa agaagattaa ttcctgggtt 480 
gaaagcaaaa caaatgaaaa aatcaaggac ttgttcccag atggctctat tagtagctct 54 0 
accaagctgg tgctggtgaa catggtttat tttaaagggc aatgggacag ggagtttaag 600 
aaagaaaata ctaaggaaga gaaattttgg atgaataaga gcacaagtaa atctgtacag 660 
atgatgacac agagccattc ctttagcttc actttcctgg aggacttgca ggccaaaatt 720 
ctagggattc catataaaaa caacgaccta agcatgtttg tgcttctgcc caacgacatc 780 
gatggcctgg agaagataat agataaaata agtcctgaga aattggtaga gtggactagt 840 
ccagggcata tggaagaaag aaaggtgaat ctgcacttgc cccggtttga ggtggaggac 900 
agttacgatc tagaggcggt cctggctgcc atggggatgg gcgatgcctt cagtgagcac 960 
aaagccgact actcgggaat gtcgtcaggc tccgggttgt acgcccagaa gttcctgcac 1020 
agttcctttg tggcagtaac tgaggaaggc accgaggctg cagctgccac tggcataggc 1080 
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tttactgtca catccgcccc aggtcatgaa aatgttcact gcaatcatcc cttcctgttc 1140 

ttcatcaggc acaatgaatc caacagcatc ctcttcttcg gcagattttc ttctccttaa 1200 

gatgatcgtt gccatggcat tgctgctttt agcaaaaaac aactaccagt gttactcata 1260 

tgattatgaa aatcgtccat tcttttaaat ggtggctcac ttgcattt 1308 

<210> 110 
<211> 391 
<212> PRT 

<213> Homo sapiens 
<400> 110 

Met Asp Ser Leu Gly Ala Val Ser Thr Arg Leu Gly Phe Asp Leu Phe 

1 5 10 . 15 

Lys Glu Leu Lys Lys Thr Asn Asp Gly Asn lie Phe Phe Ser Pro Val 

20 25 30 

Gly He Leu Thr Ala He Gly Met Val Leu Leu Gly Thr Arg Gly Ala 

35 40 45 

Thr Ala Ser Gin Leu Glu Glu Val Phe His Ser Glu Lys Glu Thr Lys 

50 55 60 

Ser Ser Arg He Lys Ala Glu Glu Lys Glu Val He Glu Asn Thr Glu 
65 70 75 80 

Ala Val His Gin Gin Phe Gin Lys Phe Leu Thr Glu He Ser Lys Leu 

85 90 95 

Thr Asn Asp Tyr Glu Leu Asn He Thr Asn Arg Leu Phe Gly Glu Lys 

100 . 105 110 

Thr Tyr Leu Phe Leu Gin Lys Tyr Leu Asp Tyr Val Glu Lys Tyr Tyr 

115 120 125 

His Ala Ser Leu Glu Pro Val Asp Phe Val Asn Ala Ala Asp Glu Ser 

130 135 140 

Arg Lys Lys He Asn Ser Trp Val Glu Ser Lys Thr Asn Glu Lys He 
145 150 155 ^ 160 

Lys Asp Leu Phe Pro Asp Gly Ser He Ser Ser Ser Thr Lys Leu Val 

165 170 175 

Leu Val Asn Met Val Tyr Phe Lys Gly Gin Trp Asp Arg Glu Phe Lys 

180 185 " 190 

Lys Glu Asn Thr Lys Glu Glu Lys Phe Trp Met Asn Lys Ser Thr Ser 

195 200 205 

Lys Ser Val Gin Met Met Thr Gin Ser His Ser Phe Ser Phe Thr Phe 

210 215 220 

Leu Glu Asp Leu Gin Ala Lys He Leu Gly He Pro Tyr Lys Asn Asn 
225 230 235 " 240 

Asp Leu Ser Met Phe Val Leu Leu Pro Asn Asp He Asp Gly Leu Glu 

245 250 255 

Lys He He Asp Lys He Ser Pro Glu Lys Leu Val Glu Trp Thr Ser 

260 265 270 

Pro Gly His Met Glu Glu Arg Lys Val Asn Leu His Leu Pro Arg Phe 

275 280 285 

Glu Val Glu Asp Ser Tyr Asp Leu Glu Ala Val Leu Ala Ala Met Gly 

290 295 300 

Met Gly Asp Ala Phe Ser Glu His Lys Ala Asp Tyr Ser Gly Met Ser 
305 310 315 " 320 

Ser Gly Ser Gly Leu Tyr Ala Gin Lys Phe Leu His Ser Ser Phe Val 

325 330 335 

Ala Val Thr Glu Glu Gly Thr Glu Ala Ala Ala Ala Thr Gly He Gly 

340 345 350 

Phe Thr Val Thr Ser Ala Pro Gly His Glu Asn Val His Cys Asn His 

355 360 365 

Pro Phe Leu Phe Phe He Arg His Asn Glu Ser Asn Ser He Leu Phe 
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370 375 
Phe Gly Arg Phe Ser Ser Pro 
385 390 



380 



<210> 111 

<211> 1419 

<212> DNA 

<213> Homo sapiens 

<400> 111 

ggagaactat aaattaagga tcccagctac ttaattgact tatgcttcct agttcgttgc 60 
ccagccacca ccgtctctcc aaaaacccga ggtctcgcta aaatcatcat ggattcactt 120 
ggcgccgtca gcactcgact tgggtttgat cttttcaaag agctgaagaa aacaaatgat 180 
ggcaacatct tcttttcccc tgtgggcatc ttgactgcaa ttggcatggt cctcctgggg 240 
acccgaggag ccaccgcttc ccagttggag gaggtgtttc actctgaaaa agagacgaag 300 
agctcaagaa taaaggctga agaaaaagag gtggtaagaa taaaggctga aggaaaagag 360 
attgagaaca cagaagcagt acatcaacaa ttccaaaagt ttttgactga aataagcaaa 420 
ctcactaatg attatgaact gaacataacc aacaggctgt ttggagaaaa aacatacctc 480 
ttccttcaaa aatacttaga ttatgttgaa aaatattatc atgcatctct ggaacctgtt 540 
gattttgtaa atgcagccga tgaaagtcga aagaagatta attcctgggt tgaaagcaaa 600 
acaaatgaaa aaatcaagga cttgttccca gatggctcta ttagtagctc taccaagctg 660 
gtgctggtga acatggttta ttttaaaggg caatgggaca gggagtttaa gaaagaaaat 720 
actaaggaag agaaattttg gatgaataag agcacaagta aatctgtaca gatgatgaca 780 
cagagccatt cctttagctt cactttcctg gaggacttgc aggccaaaat tctagggatt 840 
ccatataaaa acaacgacct aagcatgttt gtgcttctgc ccaacgacat cgatggcctg 900 
gagaagataa tagataaaat aagtcctgag aaattggtag agtggactag tccagggcat 960' 
atggaagaaa gaaaggtgaa tctgcacttg ccccggtttg aggtggagga cagttacgat 1020 
ctagaggcgg tcctggctgc catggggatg ggcgatgcct tcagtgagca caaagccgac 1080 
tactcgggaa tgtcgtcagg ctccgggttg tacgcccaga agttcctgca cagttccttt 1140 
gtggcagtaa ctgaggaagg caccgaggct gcagctgcca ctggcatagg ctttactgtc 1200 
acatccgccc caggtcatga aaatgttcac tgcaatcatc ccttcctgtt cttcatcagg 1260 
cacaatgaat ccaacagcat cctcttcttc ggcagatttt cttctcctta agatgatcgt 1320 
tgccatggca ttgctgcttt tagcaaaaaa caactaccag tgttactcat atgattatga 1380 
aaatcgtcca ttcttttaaa tggtggctca cttgcattt 1419 

<210> 112 
<211> 400 
<212> PRT 
<213> Homo sapiens 



<400> 112 



Met 


Asp 


Ser 


Leu 


Gly Ala 


Val 


Ser 


Thr 


Arg 


Leu Gly Phe Asp Leu Phe 


1 








5 










10 


15 


Lys 


Glu 


Leu 


Lys 


Lys 


Thr 


Asn 


Asp 


Gly 


Asn 


He Phe Phe Ser Pro Val 








20 










25 




30 


Gly 


He 


Leu 


Thr 


Ala 


He 


Gly 


Met 


Val 


Leu 


Leu Gly Thr Arg Gly Ala 






35 










40 






45 


Thr 


Ala 


Ser 


Gin 


Leu 


Glu 


Glu 


Val 


Phe 


His 


Ser Glu Lys Glu Thr Lys 




50 










55 








60 


Ser 


Ser 


Arg 


He 


Lys 


Ala 


Glu 


Glu 


Lys 


Glu 


Val Val Arg He Lys Ala 


65 










70 










75 80 


Glu 


Gly 


Lys 


Glu 


He 


Glu 


Asn 


Thr 


Glu 


Ala 


Val His Gin Gin Phe Gin 










85 










90 


95 


Lys 


Phe 


Leu 


Thr 


Glu 


He 


Ser 


Lys 


Leu 


Thr 


Asn Asp Tyr Glu Leu Asn 








100 










105 




110 


He 


Thr 


Asn 


Arg 


Leu 


Phe 


Gly 


Glu 


Lys 


Thr 


Tyr Leu Phe Leu Gin Lys 



115 120 125 
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Tyr Leu 


Asp Tyr Val 


Glu 


Lys 


Tyr 


Tyr 


His 


Ala Ser 


Leu Glu Pro Val 




130 






135 








140 




Asp 


Phe 


Val Asn Ala 


Ala 


Asp 


Glu 


Ser 


Arg 


Lys Lys 


He Asn Ser Trp 


145 






150 










155 


160 


Val 


Glu 


Ser Lys Thr 


Asn 


Glu 


Lys 


He 


Lys 


Asp Leu 


Phe Pro Asp Gly 






165 










170 




175 


Ser 


lie 


Ser Ser Ser 


Thr 


Lys 


Leu 


Val 


Leu 


Val Asn 


Met Val Tyr Phe 






180 








185 






- 190 


Lys 


Gly 


Gin Trp Asp 


Arg 


Glu 


Phe 


Lys 


Lys 


Glu Asn 


Thr Lys Glu Glu 






1 ft c 
195 






200 








205 


Lys 


Phe 


Trp Met Asn 


Lys 


Ser 


Thr 


Ser 


Lys 


Ser Val 


Gin Met Met Thr 




210 






215 






220 




Gin 


Ser 


His Ser Phe 


Ser 


Phe 


Thr 


Phe 


Leu 


Glu Asp 


Leu Gin Ala Lys 


225 






230 










235 


240 


lie 


Leu 


Gly lie Pro 


Tyr 


Lys 


Asn 


Asn 


Asp 


Leu Ser 


Met Phe Val Leu 






245 










250 




255 


Leu 


Pro 


Asn Asp lie 


Asp 


Gly 


Leu 


Glu 


Lys 


He He 


Asp Lys He Ser 






260 








265 






270 


Pro 


Glu 


Lys Leu Val 


Glu 


Trp 


Thr 


Ser 


Pro 


Gly His 


Met Glu Glu Arg 






275 






280 






285 


Lys Val 


Asn Leu His 


Leu 


Pro 


Arg 


Phe 


Glu 


Val Glu 


Asp Ser Tyr Asp 




290 






295 








300 


Leu 


Glu 


Ala Val Leu 


Ala 


Ala 


Met 


Gly 


Met 


Gly Asp 


Ala Phe Ser Glu 


305 






310 










315 


320 


His 


Lys 


Ala Asp Tyr 


Ser 


Gly 


Met 


Ser 


Ser 


Gly Ser 


Gly Leu Tyr Ala 






325 










330 




335 


Gin Lys 


Phe Leu His 


Ser 


Ser 


Phe 


Val 


Ala 


Val Thr 


Glu Glu Gly Thr 






340 








345 






350 


Glu 


Ala 


Ala Ala Ala 


Thr 


Gly 


lie 


Gly 


Phe 


Thr Val 


Thr Ser Ala Pro 






355 






360 








365 


Gly His 


Glu Asn Val 


His 


Cys 


Asn 


His 


Pro 


Phe Leu 


Phe Phe He Arg 




370 






375 








380 


His 


Asn 


Glu Ser Asn 


Ser 


lie 


Leu 


Phe 


Phe 


Gly Arg 


Phe Ser Ser Pro 


385 






390 










395 


400 



<210> 113 
<211> 957 
<212> DNA 

<213> Homo sapiens 
<4'00> 113 

ctcgaccttc tctgcacagc ggatgaaccc tgagcagctg aagaccagaa aagccactat 60 

gactttctgc ttaattcagg agcttacagg attcttcaaa gagtgtgtcc agcatccttt 120 

gaaacatgag ttcttaccag cagaagcaga cctttacccc accacctcag cttcaacagc 180 

agcaggtgaa acaacccagc cagcctccac ctcaggaaat atttgttccc acaaccaagg 240 

agccatgcca ctcaaaggtt ccacaacctg gaaacacaaa gattccagag ccaggctgta 300 

ccaaggtccc tgagccaggc tgtaccaagg tccctgagcc aggttgtacc aaggtccctg 360 

agccaggatg taccaaggtc cctgagccag gttgtaccaa ggtccctgag ccaggctaca 420 

ccaaggtccc tgagccaggc agcatcaagg tccctgacca aggcttcatc aagtttcctg 480 

agccaggtgc catcaaagtt cctgagcaag gatacaccaa agttcctgtg ccaggctaca 540 

caaaggtacc agagccatgt ccttcaacgg tcactccagg cccagctcag cagaagacca 600 

agcagaagta atttggtgca cagacaagcc cttgagaagc caaccaccag atgctggaca 660 

ccctcttccc atctgtttct gtgtcttaat tgtctgtaga ccttgtaatc agtacattct 720 

caccccaagc catagtctct ctcttatttg tatcctaaaa atacggtact ataaagcttt 780 

tgttcacaca cactctgaag aatcctgtaa gcccctgaat taagcagaaa gtcttcatgg 840 

cttttctggt cttcggctgc tcagggttca tctgaagatt cgaatgaaaa gaaatgcatg 900 

tttcctgctc tgccctcatt aaattgcttt taattccaaa aaaaaaaaaa aaaaaaa 957 
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<210> 114 
<211> 161 
<212> PRT 
<213> Homo sapiens 



<400> 114 



Met 


Ser 


Ser Tyr 


Gin Gin Lys Gin Thr 


Phe Thr Pro 


Pro 


Pro 


Gin 


Leu 


1 






5 


10 






15 




Gin 


Gin 


Gin Gin 


Val Lys Gin Pro Ser 


Gin Pro Pro 


Pro 


Gin 


Glu 


He 






20 


25 






30 






Phe 


Val 


Pro Thr 


Thr Lys Glu Pro Cys 


His Ser Lys 


Val 


Pro 


Gin. 


Pro 






35 


40 




45 








Gly 


Asn 


Thr Lys 


lie Pro Glu Pro Gly 


Cys Thr Lys 


Val 


Pro 


Glu 


Pro 




50 




55 


60 










Gly 


Cys 


Thr Lys 


Val Pro Glu Pro Gly 


Cys Thr Lys 


Val Pro 


Glu 


Pro 


65 






70 


75 








80 


Gly 


Cys 


Thr Lys 


Val Pro Glu Pro Gly 


Cys Thr Lys 


Val 


Pro 


Glu 


Pro 








85 


90 






95 




Gly 


Tyr 


Thr Lys 


Val Pro Glu Pro Gly 


Ser lie Lys 


Val 


Pro Asp 


Gin 






100 


105 






110 






Gly 


Phe 


lie Lys 


Phe Pro Glu Pro Gly 


Ala lie Lys 


Val 


Pro 


Glu 


Gin 






115 


120 




125 








Gly 


Tyr 


Thr Lys 


Val Pro Val Pro Gly 


Tyr Thr Lys 


Val 


Pro 


Glu 


Pro 




130 




135 


140 










Cys 


Pro 


Ser Thr 


Val Thr Pro Gly Pro 


Ala Gin Gin 


Lys Thr Lys 


Gin 


145 






150 


155 








160 



<210> 115 
<211> 506 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 8, 21, 31, 32, 58, 75, 89, 96, 99, 103, 122, 126, 147, 150, 
158, 195, 210, 212, 219, 226, 246, 248, 249, 255, 258, 261, 
263, 265, 275, 304, 317, 321, 331, 337, 340, 358, 371, 377, 
380, 396, 450, 491 
<223> n = A,T,C or G 

<400> 115 

cattggtnct ttcatttgct ntggaagtgt nnatctctaa cagtggacaa 
gccttaaact ctgtnacact tttgggaant gaaaanttng tantatgata 
angtanagat gttctggata ccattanatn tgcccccngt gtcagaggct 
tatgtaaatg gtatntcatt cgctactatn antcaattng aaatanggtc 
gaatantnng cagcncanct nanangctgt ctgtngtatt cattgtggtc 
acancattgt aacctcnatc nagtgagaca nactagnaan ttcctagtga 
ttccaaatgg nctcatntcn aatgtttaaa agttanttaa "gtgtaagaaa 
atgttccacc aactagtacc tgtaatgacn ggcctgtccc aacacatctc 
gactgtggta ncccgcatcg gaaaaa 



agttcccngt 60 
ggttattctg 120 
catattgtgt 180 
tttgggttat 240 
atagcacctc 300 
tggctcanga 360 
tacagactgg 420 
ccttttccat 480 
506 



<210> 116 
<211> 3079 
<212> DNA 
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<213> Homo sapiens 
<4Q0> 116 

ggatccccgg gtttcctaaa ccccccacag 
ggtcaaaggg cagaaaaaat gctgagttag 
aaagaggtca aagtggttta tagggggcgc 
cttgcaggca gatctgccca gtgggctctg 
tgtgcacaaa aggatgaaac tctattttcc 
cagattgcct ttcccagagg gaaaaccctg 
gcagatcact ggggaatcgt ttgccccccg 
caggtgctca gcatgtaccg tactgggatg 
ccaggacact gccatgccaa tgccccctca 
gccccagcct ctatggtgaa gacatacttg 
atcagtgctc gaaggcaagg ttatttctaa 
tgcaccccac accactgtgc aggtgtgacc 
ccagcccact taatcatcac agctcgacag 
taaaaagggg catcaccgtt cctgggtaac 
gttctctcca gcacctccca acccactagt 
ccaccatgtc tcgccagtca agtgtgtctt 
ccgcctctgc catcaccccg tctgtctccc 
ggggtggcgg tggtggtggc ttcggcaggg 
gctatggcag ccggagcctc tacaacctgg 
gtggtggcag cttcaggaac cggtttggtg 
gtggtgccgg tagtggattt ggtttcggcg 
gcggagctgg ctttggaggt ggcttcggtg 
gtatccaaga ggtcactgtc aaccagagtc 
ccagcatcca gagggtgagg accgaggagc 
ttgcctcctt catcgacaag gtgcggttcc 
agtggaccct gctgcaggag cagggcacca 
tcgagcagta catcaacaac ctcaggaggc 
gcctggactc agagctgaga aacatgcagg 
aggatgaaat caacaagcgt accactgctg 
tagatgctgc ctacatgaac aaggtggagc 
agattaactt catgaagatg ttctttgatg 
ctgacacctc agtggtcctc tccatggaca 
tcgctgaggt caaggcccag tatgaggaga 
cctggtatca gaccaagtat gaggagctgc 
tccgcaacac caagcatgag atctctgaga 
agattgacaa tgtcaagaaa cagtgcgcca 
agcgtgggga gctggccctc aaggatgcca 
tgcagaaggc caagcaggac atggcccggc 
ccaagctggc cctggacgtg gagatcgcca 
gcagactcag tggagaagga gttggaccag 
cctctggata tggcagtggc agtggctatg 
gcctcggtgg aggtcttgcc ggaggtagca 
gtgtcggcct aggtggtggg ctcagtgtgg 
gagggctggg ggtgggcttt ggcagtggcg 
ccaccacctc ctcctcccgg aagagcttca 
ttccaagtgc agcaacccag cccatggaga 
gttttatcct tttctggaga gtagtctaga 
ttcccaggag agccccattc ccagcccctg 
tcaaatcagc cttcaggttt cccacagcat 
tcccaaatct aaatcatcaa aacagaatcc 
taactacctc cagaatgtgt tcaataaaat 
gttttttttt tctacccaa 



agtcctgccc aggccaaaga gcaaggaaaa 60 
gaggagctat ggaaggataa acctggcctt 120 
tgagggcttc ccacattctc tggcctaaac 180 
ggatagctgt gccttcccta acaaaaaaat 24 0 
ctctagcaca taaccaagaa tataaggcta 300 
cagcaacctg ctgcctggaa aagtgtaaga 360 
ctgatggaca gcttccccaa gctccaaggg 420 
gttgtcaata ctcctggtcc tgtaagagtc 480 
gttcctggca tcctttttgg gctgctcaca 540 
ctagcagcgt caccaacttg ttgccaagag 600 
ctgagcagag cctgccagga agaaagcgtt 660 
ggtgagctca cagctgcccc ccaggcatgc 720 
ctctctcgcc cagcccagtt ctggaaggga 780 
agagccacct tctgcgtcct gctgagctct 840 
gcctggttct cttgctccac caggaacaag 900 
ccggagcggg gggcagtcgt agcttcagca 960 
gcaccagctt cacctccgtg tcccggtccg 1020 
tcagccttgc gggtgcttgt ggagtgggtg 1080 
ggggctccaa gaggatatcc atcagcacta 1140 
ctggtgctgg aggcggctat ggctttggag 1200 
gtggagctgg tggtggcttt gggctcggtg 1260 
gccctggctt tcctgtctgc cctcctggag 1320 
tcctgactcc cctcaacctg caaatcgacc 1380 
gcgagcagat caagaccctc aacaataagt 14 40 
tggagcagca gaacaaggtt ctggaaacaa 1500 
agactgtgag gcagaacctg gagccgttgt 15 60 
agctggacag catcgtgggg gaacggggcc 1620 
acctggtgga agacttcaag aacaagtatg 1680 
agaatgagtt tgtgatgctg aagaaggatg 1740 
tggaggccaa ggttgatgca ctgatggatg 1800 
cggagctgtc ccagatgcag acgcatgtct 1860 
acaaccgcaa cctggacctg gatagcatca 1920 
ttgccaaccg cagccggaca gaagccgagt 1980 
agcagacagc tggccggcat ggcgatgacc 2040 
tgaaccggat gatccagagg ctgagagccg 2100 
atctgcagaa cgccattgcg gatgccgagc 2160 
ggaacaagct ggccgagctg gaggaggccc 2220 
tgctgcgtga gtaccaggag ctcatgaaca 2280 
cttaccgcaa gctgctggag ggcgaggaat 2340 
tcaacatctc tgttgtcaca agcagtgttt 2400 
gcggtggcct cggtggaggt cttggcggcg 2460 
gtggaagcta ctactccagc agcagtgggg 2520 
ggggctctgg cttcagtgca agcagtagcc '2580 
ggggtagcag ctccagcgtc aaatttgtct 2640 
agagctaaga acctgctgca agtcactgcc 2700 
ttgcctcttc taggcagttg ctcaagccat 27 60 
ccaagccaat tgcagaacca cattctttgg 2820 
gtctcccgtg ccgcagttct atattctgct 2880 
ggcccctgct gacacgagaa cccaaagttt 2940 
ccaccccaat cccaaatttt gttttggttc 3000 
gttttataat ataagctggt gtgcagaatt 3060 

3079 



<210> 117 
<211> 6921 
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<212> DNA 

<213> Homo sapiens 

<400> 117 

gaattctgac tgtccactca aaacttctat 
aattgagata accatttaca aagacgatga 
taaatggaag gtcattagtc ctactgggaa 
cgttcctcca ccaaacaaag aagcggtgga 
gaatgtcctg actctttggc atgagtctca 
ttatctcatc aatgaaattg atagaattcg 
gctacctggt gaacatcagc aagttctaag 
ggaagatagc caggaatccc aagtcttttc 
ggttaatgta tgtaagcagt attatcaaga 
agaggaatca gtttataatc tctacatctc 
gaactgtgaa gatcggctga ttagacagat 
tgaaagtgtg ttcagaatca cagaacagga 
agatgatttg ggaacaatca caaataagtg 
ttcatcagtc cctaccctac gatcagagct 
ctattctatg tcttccactt acatagataa 
cactcaagct gcagaagccc tcgtaaaact 
agttatagct gacaagaata atattgagaa 
tgaagtagat gaaaagagac aggtattcca 
agccatcagt gatgaaatgt ttaaaacgta 
caaagaaaaa gcagatcaat tagttgaaag 
caggttacgg gacttagagg gcattggcaa 
tcctttagat gattggatcc agcaggttga 
gcctgaaaat agtaaaaccc tagccacaca 
aatagaaatg aaacagagca aaatggacga 
tacagtgaag gactatgaat tacaaacaat 
aaaatctcca gtgaaacgcc gaagaatgca 
catggaccta aggactcgat atactgccct 
tgctggtgat tcattgaaga ggctggaaga 
tgaacatggg gcatattcag atctgcttca 
caaacttaca ggaaagataa gtgagttgga 
gtcccgagta gaggaagaac ttccgaaggt 
gcagcagaga aatgtagaag atatctctct 
gcagtaccgc agggaacttg aaaccattgt 
ggagcgggtg aggcagctca ccatagaggc 
cctcctgaat tttcgcaatc agttggagga 
tcatcttaaa agaaaagatt taagtctcaa 
ggaagaatta agaagaaaga gagacaatga 
ggaaaaagac cttgcatttc agaaacaggt 
aattgaattg gaagcaagaa gaaaaataac 
attgccagtg tgtccgatca cacaggctac 
agaacatgac aagcagaaag cagaagaact 
caatagaaag gctgaacaag acatgagaga 
tgaaaaaacg tcatctgagg aaaaggctcg 
taatacactc agatgcctta agttggagct 
ttctcaacaa ctcagagagc ttggtaggca 
agccatgcaa gaagctagtg atctcaagaa 
atctcttaat catgaaaaag ggaaactaca 
tgctgtagct gagaagaata ttcagcattt 
gaaagaatta gaaagactac aaatctgcca 
tgagaaaagc catgagcagt tgcttcaaaa 
aatccaaagg ctcaatgaag aattggagaa 
aaaagtagag gagcttacta ggcagaataa 
ggcagaatca gagaatatag ttttagagaa 
gaaaattcag gcagatggtt ttaaagatca 



tccgatcaaa gctatctgtg actacagaca 60 
atgtgttttg gcgaataact ctcatcgtgc 120 
tgaggctatg gtcccatctg tgtgcttcac 180 
ccttgccaac agaattgagc aacagtatca 240 
cataaacatg aagagtgtag tatcctggca 300 
agctagcaat gtggcttcaa taaagacaat 360 
taatctacaa tctcgttttg aagattttct 420 
aggctcagat ataacacaac tggaaaagga 480 
acttcttaaa tctgcagaaa gagaggagca 540 
tgaagttcga aacattagac ttcggttaga 600 
tcgaactccc ctggaaagag atgatttgca 660 
gaaactaaag aaagagctgg aacgacttaa 720 
tgaggagttt ttcagtcaag cagcagcctc 780 
taatgtggtc cttcagaaca tgaaccaagt 840 
gttgaaaact gttaacttgg tgttaaaaaa 900 
ctatgaaact aaactgtgtg aagaagaagc 960 
tctaataagt actttaaagc aatggagatc 1020 
tgccttagag gatgagttgc agaaagctaa 1080 
taaagaacgg gaccttgatt ttgactggca 1140 
gtggcaaaat gttcatgtgc agattgacaa 1200 
atcactgaag tactacagag acacttacca 1260 
aactactcag agaaagattc aggaaaatca 1320 
gttgaatcaa cagaagatgc tggtgtccga 1380 
gtgtcaaaaa tatgcagaac agtactcagc 1440 
gacctaccgg gccatggtag attcacaaca 1500 
gagttcagca gatctcatta ttcaagagtt 1560 
ggtcactctc atgacacaat atattaaatt 1620 
ggaggagatt aaaaggtgta aggagacttc 1680 
gcgtcagaag gcaacagtgc ttgagaatag 1740 
aagaatggta gctgaactaa agaaacaaaa 1800 
cagggaggct gcagaaaatg aattgagaaa 1860 
gcagaagata agggctgaaa gtgaagccaa 1920 
gagagagaag gaagccgctg aaagagaact 1980 
cgaggctaaa agagctgccg tggaagagaa 204 0 
aaacaccttt accagacgaa cactggaaga 2100 
tgatttggag caacaaaaaa ataaattaat 2160 
ggaagaactc ttgaagctga taaagcagat 2220 
agcagagaaa cagttgaaag aaaagcagaa 2280 
tgaaattcag tatacatgta gagaaaatgc 2340 
atcatgcagg gcagtaacgg gtctccagca 2400 
caaacagcag gtagatgaac taacagctgc 2460 
gctgacatat gaacttaatg ccctccagct 2520 
tttgctaaaa gataaactag atgaaacaaa 2580 
ggaaaggaag gatcaggcgg agaaagggta 2640 
attgaatcaa accacaggta aagctgaaga 2700 
aataaagcgc aattatcagt tagaattaga 2760 
aagagaagta gacagaatca caagggcaca 2820 
aaattcacaa attcattctt ttcgagatga 2880 
gagaaaatca gatcatctaa aagaacaatt 2940 
tatcaaagct gaaaaagaaa ataatgataa 3000 
aagtaatgag tgtgcagaga tgctaaaaca 3060 
tgaaaccaaa ttaatgatgc agagaattca 3120 
acaaactatc cagcaaagat gtgaagcact 3180 
gctacgcagc acaaatgaac acttgcataa 3240 
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acagacaaaa acagagcagg attttcaaag 
gaaaagtcaa aatttggtaa gtgaatttaa 
ccagaatacc aagaaagaag ttagaaatct 
gaagcgacgc ggggagcaga aagttcagct 
caggttgaaa aaagtacaag acgaattaca 
cagaaagatg gttctgtttc aggaagaatc 
tcggaagaag atggaaaaat taatggagtc 
cattaggctt gactttgtgt ctcttcaaca 
gctttgtgaa acaaacatta aagaacttga 
gcagcaaggg cagcacatgg aagcaaatca 
gctgatagcc cagaagcgtg aggttgaaaa 
agagcatgaa catcaattag ttttgctcca 
agactgtacc ttcaaaccag attttgagat 
gctgtcctct agaaacactg gacaccttca 
gactcaagaa ccacagccat tggaagagaa 
caaagaagtc caattccagc caccaggggc 
ttactctgag tacttttctc agacaagcac 
ccccattaca agactgtctg aaattgagaa 
accacctgtt aggtatcaag ataacgcatg 
cttagagata gctaagaaca agcagtatga 
agaaaagaac ccagttccca gtgctgaaga 
tggactcaag aaaggggatt tccttaagaa 
tgatggtgat catgcatgtt cagtcaggga 
cactgtgact gccaggcagt tggtggaagc 
gctgcgactc ggtcttaaga ctgttgaaga 
gaaagccacc tcaattgcag ggctttacct 
ctcagcggcc gagagaatca taatagacaa 
ggctgcaaca ggttttataa ttgatcccat 
agttcttaaa ggagttgttg accccgaatt 
agctgtggga tattcttatt cttctaagac 
aatgcttgac agacaaaaag gtaaacatat 
cattgaccct gtgagaggca ttcgtgttcc 
gaataatgcc atcttacagt ttttacatga 
tcccaataac aagcaagctc tgtattactc 
agagtcccaa tgctttctgt ttccatttgg 
gaaaacacat agaatttctg tagtagatac 
ggctttccag agaaacctga ttgagaaaag 
tcagtggaag gaagctatgt tttttgaatc 
tactaaaaca ggattacact tcaatattaa 
agccttggtc aaaaagtatc aggaaggcct 
gctgagccgg ttagtcccca agaaagattt 
tgctagtggg gaaaggatct ctgtactaaa 
tactgccctc cgatgccttg aagcccaagt 
tggcaaaaag taccgggtgg ccgaagcttt 
ccagcagctg cgacagtgtg aattagtaat 
aatgatgtca gtggtggaag ctgtgaatgc 
atgtttggaa tttcagtact tgacaggagg 
atcaatagaa gaggctctcc aagtaggtat 
agatcaaaag tcatatgtca gaaatataat 
taaagaagcc ttagaaaaag ctgattttga 
atctgagccc ctgatgacag gaatttctag 
taaataactg tgcaaggggt gatgcaggct 
tatcggctac atatgcagtc tgtgaattat 
attgctaagt gctcaaaata gagtaagttt 
cttcaaatgg tttcatttag ccttgagaat 
tttttttttt tttacgtaga atgtgggata 
tttcttcaga actccccttc attgaatagt 
ctgaaagagc acgtcatgaa gcaccatgga 



aaaaattaaa tgcctagaag aagacctggc 3300 
gcaaaagtgt gaccaacaga acattatcat 3360 
gaatgcggaa ctgaatgctt ccaaagaaga 3420 
acaacaagct caggtgcaag agttaaataa 3480 
cttaaagacc atagaggagc agatgaccca 3540 
tggtaaattc aaacaatcag cagaggagtt 3600 
caaagtcatc actgaaaatg atatttcagg 3660 
agaaaactct agagcccaag aaaatgctaa 3720 
aagacagctt caacagtatc gtgaacaaat 3780 
ttaccaaaaa tgtcagaaac ttgaggatga 3840 
cctgaagcaa aaaatggacc aacagatcaa 3900 
gtgtgaaatt caaaaaaaga gcacagccaa 3960 
gacagtgaag gagtgccagc actctggaga 4020 
cccaacaccc agatcccctc tgttgagatg 4080 
gtggcagcat cgggttgttg aacagatacc 4140 
tccactcgag aaagagaaaa gccagcagtg 4200 
cgagttacag ataacttttg atgagacaaa 4260 
gataagagac caagccctga acaattctag 4320 
tgaaatggaa ctggtgaagg ttttgacacc 4380 
tatgcataca gaagtcacaa cattaaaaca 4440 
atggatgctt gaagggtgca gagcatctgg 4500 
gggcttagaa ccagagacct tccagaactt 4560 
tgatgaattt aaattccaag ggcttaggca 4620 
taagcttctg gacatgagaa caattgagca 4680 
agttcagaaa actcttaaca agtttctgac 4740 
agaatctaca aaagaaaaga tttcatttgc 4800 
aatggtggct ttggcatttt tagaagctca 4860 
ttcaggtcag acatattctg ttgaagatgc 4920 
cagaattagg cttcttgagg cagagaaggc 4980 
attgtcagtg tttcaagcta tggaaaatag 5040 
cttggaagcc cagattgcca gtgggggtgt 5100 
tccagaaatt gctctgcagc aggggttgtt 5160 
gccatccagc aacacaagag ttttccctaa 5220 
agaattactg cgaatgtgtg tatttgatgt 5280 
ggagaggaac atttccaatc tcaatgtcaa 5340 
taaaacagga tcagaattga ccgtgtatga 5400 
tatatatctt gaactttcag ggcagcaata 5460 
ctatgggcat tcttctcata tgctgactga 5520 
tgaggctata gagcagggaa caattgacaa 5580 
catcacactt acagaacttg ctgattcttt 5640 
gcacagtcct gttgcagggt attggctgac 5700 
agcctcccgt agaaatttgg ttgatcggat 5760 
cagtacaggg ggcataattg atcctcttac 5820 
gcatagaggc ctggttgatg aggggtttgc 5880 
cacagggatt ggccatccca tcactaacaa 5940 
aaatattata aataaggaaa tgggaatccg 6000 
gttgatagag ccacaggttc actctcggtt 6060 
tatagatgtc ctcattgcca caaaactcaa 6120 
atgccctcag acaaaaagaa agttgacata 6180 
tttccacaca ggacttaaac tgttagaagt 6240 
cctctactat tcttcctaat gggacatgtt 6300 
ggttcatgcc actttttcag agtatgatga 6360 
gtaacatact ctatttcttg agggctgcaa 6420 
taaattgaaa attacataag atttaatgcc 648 0 
ggttttttga aacttggcca cactaaaatg 6540 
aacttgatga actccaagtt cacagtgtca 6600 
gatcatttat taaatgataa attgcactcg 6660 
atcaaagaga aagatataaa ttcgttccca 6720 
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cagccttcaa gctgcagtgt tttagattgc 
gatatagtga ccttctttgc atattaaaat 
tcttcgcact tgaaagctaa cattatgaat 
ttcattctgt gtattttccg g 



ttcaaaaaat gaaaaagttt tgcctttttc 6780 
gtttaccaca atgtcccatt tctagttaag 6840 
attatgtgtt ggaggagggg aaggattttc 6900 

6921 



<210> 118 
<211> 946 
<212> DNA 
<213> Homo sapiens 



<400> 118 

cttctgactg ggctcaggct gacaggtaga 
ctccccatca cagctgtggt gcagtccacc 
gtcggcagtg gcttaggcct gggtggagga 
gttggaggtg gcttcagttc cagcagtggc 
ggaggcggca gttccaccat caagtacacc 
aagcactaaa gtgcgtctgc tagctctcgg 
cagagccctc tcctcaggtt gcctgtcctc 
ggtagagctg gggatgaatg cttagtgccc 
gagcacccat tgctcaccat cagatcaacc 
tggagcttca ctgttactaa attattaatt 
gagcattata agaaaatgac ctctgctcct 
ttcagaacaa cttccactta ctttccactg 
tgaaccccca cccaggcagt atccatgaaa 
gcctgtatct ctgtgatgat ttctgtgctc 
atttataata catatattct tttactttgc 
acttttttat ctgataagtg aatagttgtt 

<210> 119 

<211> 8948 

<212> DNA 

<213> Homo sapiens 



gctcaccatg gcttcttgtg tccttgtccc 60 
gtctccagtg gctatggcgg tgccagtggt 120 
agcagctact cctatggcag tggtcttggc 180 
agagccattg ggggtggcct cagctctgtt 240 
accacctcct cctccagcag gaagagctat 300 
tcccacagtc ctcaggcccc tctctggctg 360 
tcctggcctc cagtctcccc tgctgtccca 420 
tcacttcttc tctctctctc tataccatct 480 
tctgatttta catcatgatg taatcaccac 540 
tcttgcctcc agtgttctat ctctgaggct 600 
tttcattgca gaaaattgcc aggggcttat 660 
gctctcaaac tctctaactt ataagtgttg 720 
gcacaagtga ctagtcctat gatgtacaaa 780 
ttcactgttt gcaattgcta aataaagcag 840 
cttgctttgg ggccaaagtt ttgggcttaa 900 
tttaaaagat aatcta 94 6 



<400> 119 

tcaacagccc ctgctccttg ggcccctcca 
acaccaacac ccagctccga cgcagctcct 
tttcctcccg ctcctgcccc cggcccgtcg 
ggcccaggta gcgagcagcg acctcgcgag 
gtccgcctat ccttggcccc ctccgctttc 
gcgctgagcc gctctcccga ttgcccgccg 
ggatcaacac tctgggccgc atgatccgcg 
tgaccagcgg cggcgggggc accagcagga 
accagaactc ggacggctac tgtcaaaccg 
ccatccagga gctgctgcag aactgctccg 
agcctgaatt gaagtatgga gatggaatac 
gttttgccca ggccaatgac caaatggaaa 
agatgggcca gccctgtgat gcttaccaga 
gagcccttta taaagccatc agtgtccctc 
gaggctacac ttgtcagagt ggctctggct 
aatgtttggg gtggatgagg cagcaaaggg 
acctggcctc agtggagcag cacattaaca 
actatcgctg gcagctggac aaaatcaaag 
agttggagga ggagtatgaa aacctgctga 
gacagctgca gaacatcatt caggccacgt 
aggaggagga gctgctgtac gactggagcg 
aggccttctc catacgcatg agtcaactgg 



tgccatgccg taatctctcc cacccgacca 60 
ctgcgccctt gccgccctcc gagccacagc 120 
ccgtctccgc gctcgcagcg gcctcgggag 180 
ccttccgcac tcccgcccgg ttccccggcc 240 
tccgcgccgg cccgcctcgc ttatgcctcg 300 
acatgagctg caacggaggc tcccacccgc 360 
ccgagtctgg cccggacctg cgctacgagg 420 
tgtactattc tcggcgcggc gtgatcaccg 480 
gcacgatgtc caggcaccag aaccagaaca 540 
actgcttgat gcgagcagag ctcatcgtgc 600 
aactgactcg gagtcgagaa ttggatgagt 660 
tcctcgacag cttgatcaga gagatgcggc 720 
aaaggcttct tcagctccaa gagcaaatgc 780 
gagtccgcag ggccagctcc aagggtggtg 840 
gggatgagtt caccaaacat gtcaccagtg 900 
cggagatgga catggtggcc tggggtgtgg 960 
gccaccgggg catccacaac tccatcggcg 1020 
ccgacctgcg cgagaaatct gcgatctacc 1080 
aagcgtcctt tgagaggatg gatcacctgc 1140 
ccagggagat catgtggatc aatgactgcg 12 00 
acaagaacac caacatcgct cagaaacagg 1260 
aagttaaaga aaaagagctc aataagctga 1320 
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aacaagaaag tgaccaactt gtcctcaatc 
atatggacac tctgcagacg cagtggagtt 
ttcatctgaa agaaaatgct gcctactttc 
catacctgaa ggggctccag gactccatca 
ccctgcagca cctgctggaa cagatcaagg 
aatacaagcg tcaggtgcag aacttggtaa 
ctcgtaaccc agactacaga agcaataaac 
aacaagatca gaaaatcgtg cataaggggg 
gcagcaagtg gtacgtgacg ggcccgggag 
tgatcatccc tcctccgaac ccactggccg 
acgaagccat cttggctctg tggaaccagc 
ggcactactg catgattgac atagagaaga 
caatgcggca ggaagattac atgaagacga 
tcatcagaaa tagccaaggc tcagagatgt 
ctcagttcac cgatgcccag aagcattacc 
cccagcacca gacagtgacc acaactgaaa 
accataataa agtaattgaa accaacagag 
tggagctgca gaagattcgc aggcagatag 
acctccctct agcagaccag gggtcttctc 
agagtgtgca gaatgattca caagcaattg 
ttgccaactt cagaggttct gaaaagtact 
ttcagaaact ggaaaatatc aatggtgtta 
taagggcact gctccaggct attctccaaa 
ggctcactga ggaggaaact gtctgcctgg 
gactgaagaa aataaaaaat gacttgaact 
cagaactaca gaaagcccag cagatccact 
atctggactt gggcaagttc ggtgaaaaag 
tagataaaca gatcgacttt agattatggg 
attatcgtga taactatcag gctttctgca 
attccttaga atccatgaaa tttggagatt 
agaagaactt gcacagtgaa atatctggca 
ttgctgaact ttgcgccaat tcaattaagg 
caggactgga aactctgctg aacataccta 
gggtgattct gcaagaggct gcagatgttc 
ctggagacta ttacaggttc ttaagtgaga 
aaaataccaa gatcgaagtt ttggaagagg 
aaaactgtaa taagaacaaa ttcctggatc 
cccagttcaa agcgaagctt gcgagcctgg 
ggaagtcggc taagcaaaat ctagacaagt 
agatcacccg actgacttat gagattgaag 
acagatttga ccaacagaag aatgactatg 
aggagaacct tggttggcag aaattagagt 
agattgaaag gttgagggtt ctactgcagg 
atgagctggc aaaggtaaga aaccactata 
atgaaacaga gattaacatt acgaagacca 
atgattccaa aaatcttaga aaccagcttg 
aggatgaaat tgtcaggctc aatgacagca 
ctgaagaaaa cgcccttcag caaaaggcct 
atctggagat agaactgaag caggtcatgc 
agcagtccct ggaggaggct gccaagacca 
tcaaagctga gtttcaggag gaggccaagc 
aggtaagaaa caattatgat gaggagatca 
tcaacatcac caagaccacc atccaccagc 
gctaccgggc tcagatagac aatctcaccc 
agaggctgaa gaacactcta acccagacca 
tccaacagca aaaggccact ggctctgagg 
agctgagaca agtcactcag atgcgaacag 
atgatgctgc caaaaccatc caggataaaa 



53 



agcatccagc ttcagacaaa attgaggcct 1380 
ggattcttca gatcaccaag tgcattgatg 14 40 
agttttttga agaggcgcag tctactgaag 15 00 
ggaagaagta cccctgcgac aagaacatgc 1560 
agctggagaa agaacgagag aaaatccttg 1620 
acaagtctaa gaagattgta cagctgaagc 1680 
ccattattct cagagctctc tgtgactaca 17 40 
atgagtgtat cctgaaggac aacaacgagc 18 00 
gcgttgacat gcttgttccc tctgtggggc 1860 
tggacctctc ttgcaagatt gagcagtact 1920 
tctacatcaa catgaagagc ctggtgtcct 1980 
tcagggccat gacaatcgcc aagctgaaaa 2040 
tagccgacct tgagttacat taccaagagt 2100 
ttggagatga tgacaagcgg aaaatacagt 2160 
agaccctggt cattcagctc cctggctatc 2220 
tcactcatca tggaacctgc caagatgtca 2280 
aaaatgacaa gcaagaaaca tggatgctga 2340 
agcactgcga gggcaggatg actctcaaaa 24 00 
accacatcac agtgaaaatt aacgagctta 24 60 
ctgaggttct caaccagctt aaagatatgc 2520 
gctatttaca gaatgaagta tttggactat 2580 
cagatggcta cttaaatagc ttatgcacag 2640 
cagaagacat gttaaaggtt tatgaagcca 2700 
acctggataa agtggaagct taccgctgtg 2760 
tgaagaagtc gttgttggcc actatgaaga 2820 
ctcagacttc acagcagtat ccactttatg 2 8 80 
tcacacagct gacagaccgc tggcaaagga 2940 
acctggagaa acaaatcaag caattgagga 3000 
agtggctcta tgatcgtaaa cgccgccagg 30 60 
ccaacacagt catgcggttt ttgaatgagc 3120 
aacgagacaa atcagaggaa gtacaaaaaa 3180 
attatgagct ccagctggcc tcatacacct 32 40 
tcaagaggac catgattcag tccccttctg 3300 
atgctcggta cattgaacta cttacaagat 3360 
tgctgaagag tttggaagat ctgaagctga 34 20 
agctcagact ggcccgagat gccaactcgg 3480 
agaacctgca gaaataccag gcagagtgtt 3540 
aggagctgaa gagacaggct gagctggatg 3600 
gctacggcca aataaaagaa ctcaatgaga 3660 
atgaaaagag aagaagaaaa tctgtggaag 3720 
accaactgca gaaagcaagg caatgtgaaa 3780 
ctgagaaagc catcaaggag aaggagtacg 3840 
aagaaggcac ccggaagaga gaatatgaaa 3900 
atgaggagat gagtaattta aggaacaagt 3960 
ccatcaagga gatatccatg caaaaagagg 4020 
atagactttc aagggaaaat cgagatctga 4080 
tcttgcaggc cactgagcag cgaaggcgag 4140 
gtggctctga gataatgcag aagaagcagc 4200 
agcagcgctc tgaggacaat gcccggcaca 42 60 
ttcaggacaa aaataaggag atcgagagac 4320 
gccgctggga atatgaaaat gaactgagta 4380 
ttagcttaaa aaatcagttt gagaccgaga 44 40 
tcaccatgca gaaggaagag gataccagtg 4500 
gagaaaacag gagcttatct gaagaaataa 4560 
cagagaatct caggagggtg gaagaagaca 4620 
tgtctcagag gaaacagcag ctggaggttg 4 680 
aggagagcgt aagatataag caatctcttg 4740 
acaaggagat- agaaaggtta aaacaactga 4800 
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tcgacaaaga aacaaatgac cggaaatgcc 
tccagtatga cctgcagaaa gcaaacagta 
ttcaggagca agaactgaca cgcctgagga 
ctgtgaagga ccaggatatc acgcggttcc 
agcagaaggt ggaagaggag ctgaatcggc 
agaggaagaa gctggaggaa gagctggaag 
tcaaaatcac caacctgacc cagcagctgg 
aggatgacct ccggcagcag agggacgtgc 
cccaggaaga gctgaggagg ctctcttctg 
aggaacagga aagtgtcaaa caagctcact 
aagataaaag cagaagctta aatgaaagca 
cagagaacct gaccaaggag cacttgatgt 
agtacgatga cctgaggaga ggacgaagcg 
tggaactaag gagccagctg cagatcagca 
ttaatgattt acagagagag agggaaaatt 
aggctttaga ggcatctaat aggattcagg 
aggaaagaga gagccttctg gtgaaaatca 
agaggctgga ggatgagctg aatcgtgcaa 
aacagcgcct ggagtgtgag aaacagcaaa 
aatattcccg caaggaggag gctattagga 
gagagaagaa cagtcttagg agtgagatcg 
aagagaggtg caggcgtaag ctggaggatt 
cagaacgctc ccgatatcag agggagattg 
atcgagagac ccagactgag tgtgagtgga 
ggctgaggaa gaaggtgaca gcaatgcagc 
ccttggacaa actattgaag gggaagaagt 
cattccttcg gggtgcagga tctatcgctg 
ctttggtaga ggccaagaga aagaaattaa 
aggcccaggc agctacaggt ggtataattg 
acagtgccat agctcgggac ctcattgact 
aaaaagctat cactggtttt gatgatccat 
ccatcaagaa aaatttgatt gatagagaaa 
cttcaggggg tgtagtagac cctgtgaaca 
cccgggggct gattgataga gatttgtatc 
aaaactttgt ggatccagtc accaaaaaga 
gcagaatcga accacatact ggtctgctct 
tccaaggaat cagacaacct gtgaccgtca 
cgtccactgt caatgaactg gaatctggtc 
ttaaggactt cctccagggt tcaagctgca 
agaagcttgg catttatgag gccatgaaaa 
agttgctgga agcccaagca gctactggct 
taccagtgga ggaagcctac aagagaggtc 
tgtctgcaga acgagctgtc actgggtata 
tgttccaagc catgaataag gaactcatcg 
cacagatcgc aaccgggggg atcattgacc 
tagcatataa gaggggctat ttcaatgagg 
atgataccaa aggatttttt gaccccaaca 
aagaaagatg cattaaggat gaggaaacag 
agaaacaggt gcagacatca caaaagaata 
acccagaaac caataaagaa atgtctgttc 
atgaaacctt caaagaactg tgtgagcagg 
gatcagatgg ctccaccagg gtggtcctgg 
ttcaagatgc tattgacaag ggccttgttg 
gcagcctcag cctcactcaa tttgctgaca 
gcagcagcat gggcagtggt gtcagcgatg 
taagtaagat ttccaccata tccagcgtca 
cagacaccct ggaagaatcg agccccattg 
aaatctccat tacagaaggt atagagcggg 



tggaagatga aaacgcgaga ttacaaaggg 48 60 
gtgcgacgga gacaataaac aaactgaagg 4920 
tcgactatga aagggtttcc caggagagga 4980 
agaactctct gaaagagctg cagctgcaga 5040 
tgaagaggac cgcgtcagaa gactcctgca 5100 
gcatgaggag gtcgctgaag gagcaagcca 5160 
agcaggcatc cattgttaag aagaggagtg 5220 
tggatggcca cctgagggaa aagcagagga 5280 
aggtcgaggc cctgaggcgg cagttactcc 5340 
tgaggaatga gcatttccag aaggcgatag 5400 
aaatagaaat tgagaggctg cagtctctca 5460 
tagaagaaga actgcggaac ctgaggctgg 5520 
aagcggacag tgataaaaat gcaaccatct 5580 
acaaccggac cctggaactg caggggctga 5640 
tgagacagga aattgagaaa ttccaaaagc 5700 
aatcaaagaa tcagtgtact caggtggtac 5760 
aagtcctgga gcaagacaag gcaaggctgc 5820 
aatcaactct agaggcagaa accagggtga 5880 
ttcagaatga cctgaatcag tggaagactc 5940 
agatagaatc ggaaagagaa aagagtgaga 6000 
aaagactcca agcagagatc aagagaattg 6060 
ctaccaggga gacacagtca cagttagaaa 6120 
ataaactcag acagcgccca tatgggtccc 6180 
ccgttgacac ctccaagctg gtgtttgatg 6240 
tctatgagtg tcagctgatc gacaaaacaa 6300 
cagtggaaga agttgcttct gaaatccagc 6360 
gagcatctgc ttctcctaag gaaaaatact 6420 
tcagcccaga atccacagtc atgcttctgg 6480 
atccccatcg gaatgagaag ctgactgtcg 6540 
tcgatgaccg tcagcagata tatgcagcag 6600 
tttcaggcaa gacagtatct gtttcagaag 6660 
ccggaatgcg cctgctggaa gcccagattg 6720 
gtgtcttttt gccaaaagat gtcgccttgg 6780 
gatccctgaa tgatccccga gatagtcaga 6840 
aggtcagtta cgtgcagctg aaggaacggt 6900 
tgctttcagt acagaagaga agcatgtcct 6960 
ctgagctagt agattctggt atattgagac 7020 
agatttctta tgacgaggtt ggtgagagaa 7080 
tagcaggcat atacaatgag accacaaaac 7140 
ttggcttagt ccgacctggt actgctctgg 7200 
ttatagtgga tcctgttagc aacttgaggt 7260 
tggtgggcat tgagttcaaa gagaagctcc 7320 
atgatcctga aacaggaaac atcatctctt 7380 
aaaagggcca cggtattcgc ttattagaag 7440 
caaaggagag ccatcgttta ccagttgaca 7500 
aactcagtga gattctctca gatccaagtg 7560 
ctgaagaaaa tcttacctat ctgcaactaa 7620 
ggctctgtct tctgcctctg aaagaaaaga 7680 
ccctcaggaa gcgtagagtg gtcatagttg 7740 
aggaggccta caagaagggc ctaattgatt 7800 
aatgtgaatg ggaagaaata accatcacgg 7860 
tagatagaaa gacaggcagt cagtatgata 7920 
acaggaagtt ctttgatcag taccgatccg 7980 
tgatctcctt gaaaaatggt gtcggcacca 8040 
atgtttttag cagctcccga catgaatcag 8100 
ggaatttaac cataaggagc agctcttttt 8160 
cagccatctt tgacacagaa aacctggaga 8220 
gcatcgttga cagcatcacg ggtcagaggc 8280 
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ttctggaggc tcaggcctgc acaggtggca 
cacttcagga cgcagtctcc cagggtgtga 
ctgctcagaa agccttcata ggcttcgagg 
cagaggcagt gaaagaaaaa tggctcccgt 
agtacctcac gggaggtctt gttgacccgg 
ccatccggaa ggggttcata gatggccgcg 
atgccaaaat cctgacctgc cccaaaacca 
atcgctccat ggtagaagat atcactgggc 
ccaagggctt acccagccct tacaacatgt 
gctcgggatc tcgctccgga tctcgctccg 
ttgacgccac agggaattct tcctactctt 
ggcactag 



tcatccaccc aaccacgggc cagaagctgt 8340 
ttgaccaaga catggccacc agcgtgaagc 8400 
gtgtgaaggg aaagaagaag atgtcagcag 84 60 
atgaggctgg ccagcgcttc ctggagttcc 8520 
aagtgcatgg gaggataagc accgaagaag 8580 
ccgcacagag gctgcaagac accagcagct 8640 
aattaaaaat atcctataag gatgccataa 8700 
tgcgccttct ggaagccgcc tccgtgtcgt 8760 
cttcggctcc ggggtcccgc tccggctccc 8820 
ggtcccgcag tgggtcccgg agaggaagct 8880 
attcctactc atttagcagt agttctattg 8940 

8948 



<210> 120 
<211> 587 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 91, 131, 256, 263, 332, 392, 400, 403, 461, 496, 497, 499, 
510, 511, 518, 519, 539, 554, 560, 576 
<223> n = A,T,C or G 

<400> 120 

cgtcctaagc acttagacta catcagggaa gaacacagac cacatccctg tcctcatgcg 60 
gcttatgttt tctggaagaa agtggagacc nagtccttgg ctttagggct ccccggctgg 120 
gggctgtgca ntccggtcag ggcgggaagg gaaatgcacc gctgcatgtg aacttacagc 180 
ccaggcggat gccccttccc ttagcactac ctggcctcct gcatcccctc gcctcatgtt 240 
cctcccacct tcaaanaatg aanaacccca tgggcccagc cccttgccct ggggaaccaa 300 
ggcagccttc caaaactcag gggctgaagc anactattag ggcaggggct gactttgggt 360 
gacactgccc attccctctc agggcagctc angtcacccn ggnctcttga acccagcctg 420 
ttcctttgaa aaagggcaaa actgaaaagg gcttttccta naaaaagaaa aaccagggaa 480 
ctttgccagg gcttcnntnt taccaaaacn ncttctcnng gatttttaat tccccattng 540 
gcctccactt accnggggcn atgccccaaa attaanaatt tcccatc 587 

<210> 121 

<211> 619 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 260, 527, 560, 564, 566, 585, 599 
<223> n « A,T,C or G . 

<400> 121 

cactagtagg atagaaacac tgtgtcccga gagtaaggag agaagctact attgattaga 60 

gcctaaccca ggttaactgc aagaagaggc gggatacttt cagctttcca tgtaactgta 120 

tgcataaagc caatgtagtc cagtttctaa gatcatgttc caagctaact gaatcccact 180 

tcaatacaca ctcatgaact cctgatggaa caataacagg cccaagcctg tggtatgatg 240 

tgcacacttg ctagactcan aaaaaatact actctcataa atgggtggga gtattttggt 300 

gacaacctac tttgcttggc tgagtgaagg aatgatattc atatattcat ttattccatg 360 

gacatttagt tagtgctttt tatataccag gcatgatgct gagtgacact cttgtgtata 420 

tttccaaatt tttgtacagt cgctgcacat atttgaaatc atatattaag acttccaaaa 480 

aatgaagtcc ctggtttttc atggcaactt gatcagtaaa ggattcncct ctgtttggta 540 
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cttaaaacat ctactatatn gttnanatga aattcctttt ccccncctcc cgaaaaaana 600 
aagtggtggg gaaaaaaaa 619 

<210> 122 

<211> 1475 

<212> DNA 

<213> Homo sapiens 

<400> 122 

tccacctgtc cccgcagcgc cggctcgcgc cctcctgccg cagccaccga gccgccgtct 60 
agcgccccga cctcgccacc atgagagccc tgctggcgcg cctgcttctc tgcgtcctgg 120 
tcgtgagcga ctccaaaggc agcaatgaac ttcatcaagt tccatcgaac tgtgactgtc 180 
taaatggagg aacatgtgtg tccaacaagt acttctccaa cattcactgg tgcaactgcc 240 
caaagaaatt cggagggcag cactgtgaaa tagataagtc aaaaacctgc tatgagggga 300 
atggtcactt ttaccgagga aaggccagca ctgacaccat gggccggccc tgcctgccct 360 
ggaactctgc cactgtcctt cagcaaacgt accatgccca cagatctgat gctcttcagc 420 
tgggcctggg gaaacataat tactgcagga acccagacaa ccggaggcga ccctggtgct 480 
atgtgcaggt gggcctaaag ccgcttgtcc aagagtgcat ggtgcatgac tgcgcagatg 540 
gaaaaaagcc ctcctctcct ccagaagaat taaaatttca gtgtggccaa aagactctga 600 
ggccccgctt taagattatt gggggagaat tcaccaccat cgagaaccag ccctggtttg 660 
cggccatcta caggaggcac cgggggggct ctgtcaccta cgtgtgtgga ggcagcctca 720 
tcagcccttg ctgggtgatc agcgccacac actgcttcat tgattaccca aagaaggagg 780 
actacatcgt ctacctgggt cgctcaaggc ttaactccaa cacgcaaggg gagatgaagt 840 
ttgaggtgga aaacctcatc ctacacaagg actacagcgc tgacacgctt gctcaccaca 900 
acgacattgc cttgctgaag atccgttcca aggagggcag gtgtgcgcag ccatcccgga 960 
ctatacagac catctgcctg ccctcgatgt ataacgatcc ccagtttggc acaagctgtg 1020 
agatcactgg ctttggaaaa gagaattcta ccgactatct ctatccggag cagctgaaga 1080 
tgactgttgt gaagctgatt tcccaccggg agtgtcagca gccccactac tacggctctg 1140 
aagtcaccac caaaatgctg tgtgctgctg acccacagtg gaaaacagat tcctgccagg 1200 
gagactcagg gggacccctc gtctgttccc tccaaggccg catgactttg actggaattg 1260 
tgagctgggg ccgtggatgt gccctgaagg acaagccagg cgtctacacg agagtctcac 1320 
acttcttacc ctggatccgc agtcacacca aggaagagaa tggcctggcc ctctgagggt 1380 
ccccagggag gaaacgggca ccacccgctt tcttgctggt tgtcattttt gcagtagagt 1440 
catctccatc agctgtaaga agagactggg aagat 1475 



<210> 123 

<211> 2294 

<212> DNA 

<213> Homo sapiens 

<400> 123 

cagcgccggc tcgcgccctc ctgccgcagc 
gccaccatga gagccctgct ggcgcgcctg 
aaaggcagca atgaacttca tcaagttcca 
tgtgtgtcca acaagtactt ctccaacatt 
gggcagcact gtgaaataga taagtcaaaa 
cgaggaaagg ccagcactga caccatgggc 
gtccttcagc aaacgtacca tgcccacaga 
cataattact gcaggaaccc agacaaccgg 
ctaaagccgc ttgtccaaga gtgcatggtg 
tctcctccag aagaattaaa atttcagtgt 
attattgggg gagaattcac caccatcgag 
aggcaccggg ggggctctgt cacctacgtg 
gtgatcagcg ccacacactg cttcattgat 
ctgggtcgct caaggcttaa ctccaacacg 
ctaatcctac acaaggacta cagcgctgac 
ctgaagatcc gttccaagga gggcaggtgt 
tgcctgccct cgatgtataa cgatccccag 



caccgagccg ccgtctagcg ccccgacctc 60 
cttctctgcg tcctggtcgt gagcgactcc 120 
tcgaactgtg actgtctaaa tggaggaaca 180 
cactggtgca actgcccaaa gaaattcgga 240 
acctgctatg aggggaatgg tcacttttac 300 
cggccctgcc tgccctggaa ctctgccact 360 
tctgatgctc ttcagctggg cctggggaaa 420 
aggcgaccct ggtgctatgt gcaggtgggc 480 
Catgactgcg cagatggaaa aaagccctcc 540 
ggccaaaaga ctctgaggcc ccgctttaag 600 
aaccagccct ggtttgcggc catctacagg 660 
tgtggaggca gcctcatcag cccttgctgg 720 
tacccaaaga aggaggacta catcgtctac 780 
caaggggaga tgaagtttga ggtggaaaac 840 
acgcttgctc accacaacga cattgccttg 900 
gcgcagccat cccggactat acagaccatc 960 
tttggcacaa gctgtgagat cactggcttt 1020 
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ggaaaagaga attctaccga ctatctctat 
ctgatttccc accgggagtg tcagcagccc 
atgctgtgtg ctgctgaccc acagtggaaa 
cccctcgtct gttccctcca aggccgcatg 
ggatgtgccc tgaaggacaa gccaggcgtc 
atccgcagtc acaccaagga agagaatggc 
cgggcaccac ccgctttctt gctggttgct 
taagaagagc tgggaatata ggctctgcac 
gaacgacaat agctttaccc tcaggcatag 
gccaggatgg aggggtggtc ctgactcaac 
ggactgaagc ctgcaggagt taaaaagggc 
agagccagct cccccgaccg gtgggcattt 
tcccaattag gaagtgtaag cagctgaggt 
agcggtttgg ggagcagaga cactaacgac 
gtatcaggaa atatatatgt gtgtgtatgt 
agtgtgagta agagctggtg tctgattgtt 
ctgtgatgcc acacagagtg gtctttctgg 
gggtccccca cgtgacagtg cctgggaatg 
actgtctcag tttcactttc acatagatgt 
agcctagttc atccaatcct cactgggtgg 
tatatttcac tatttttatt tatatttttg 
tgatttttct gatg 

<210> 124 
<211> 956 
<212> DNA 
<213> Homo sapiens 

<400> 124 

gatgagttcc gcaccaagtt tgagacagac 
atcaatggcc tgcgcagggt gctggatgag 
cagattgaga acctcaagga ggagctggcc 
aacgccctgc gaggccaggt gggtggtgag 
gtggacctga gccgcatcct caacgagatg 
aaccgcaagg atgccgagga ttggttcttc 
gccaccaaca gtgagctggt gcagagtggc 
atgcaggcct tggagataga gctgcagtcc 
aacctggcgg agacagagaa ccgctactgc 
ggcagcgtgg aggagcagct ggcccagctt 
tacaaaatcc tgctggatgt gaagacgcgg 
ctgctggagg gagaggatgc ccacctgact 
caggtgcgta ccattgtgga agaggtccag 
gtccaccaga ccacccgctg aggactcagc 
cgcagccgcc ccatctgccc cacagtctcc 
tcccttcccc atgcttcctt gcctgatgac 

<210> 125 
<211> 486 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 16 

<223> n = A,T,C or G 



ccggagcagc tgaaaatgac tgttgtgaag 1080 
cactactacg gctctgaagt caccaccaaa 114 0 
acagattcct gccagggaga ctcaggggga 1200 
actttgactg gaattgtgag ctggggccgt 1260 
tacacgagag tctcacactt cttaccctgg 1320 
ctggccctct gagggtcccc agggaggaaa 1380 
attttgcagt agagtcatct ccatcagctg 1440 
agatggattt gcctgtgcca ccaccagggc 1500 
gcctgggtgc tggctgccca gacccctctg 1560 
atgttactga ccagcaactt gtctttttct 1620 
agggcatctc ctgtgcatgg gctcgaaggg 1680 
gtgaggccca tggttgagaa atgaataatt 174 0 
ctcttgaggg agcttagcca atgtgggagc 18 00 
ttcagggcag ggctctgata ttccatgaat 1860 
ttgcacactt gtgtgtgggc tgtgagtgta 1920 
aagtctaaat atttccttaa actgtgtgga 1980 
agaggttata ggtcactcct ggggcctctt 2040 
tattattctg cagcatgacc tgtgaccagc 2100 
ccctttcttg gccagttatc ccttcctttt 2160 
ggtgaggacc actcctgtac actgaatatt 2220 
taattttaaa taaaagtgat caataaaatg 228 0 

2294 



caggccctgc gcctgagtgt ggaggccgac 60 
ctgaccctgg ccagagccga cctggagatg 120 
tacctgaaga agaaccacga ggaggagatg 180 
atcaatgtgg agatggacgc tgccccaggc 240 
cgtgaccagt atgagaagat ggcagagaag 300 
agcaagacag aggaactgaa ccgcgaggtg 360 
aagagtgaga tctcggagct ccggcgcacc 420 
cagctcagca tgaaagcatc cctggagggc 480 
gtgcagctgt cccagatcca ggggctgatt 54 0 
cgctgcgaga tggagcagca gaaccaggaa 600 
ctggagcagg agattgccac ctaccgccgc 660 
cagtacaaga aagaaccggt gaccacccgt 720 
gatggcaagg tcatctcctc ccgcgagcag 780 
taccccggcc ggccacccag gaggcaggga 840 
ggcctctcca gcctcagccc cctgcttcag 900 
aataaaagct tgttgactca gctatg 956 



<400> 125 

aaattatata tagtgnttca gctcccattg tggtgttcat agtcttctag gaacagataa 60 
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acttaagtat tcaattcact cttggcattt 
ttggaaaact gcttttcttc tgagaacctt 
tctaagtcca gagctaactt agtactgttt 
tctgtttagc cagtgttacc aaggtaagct 
gcattttagg acattatggc agctttagaa 
agcgcaggtt ttggatacta gagaaagtca 
tctgatgtga attcaaattt tacctctgtt 
tttact 

<210> 126 
<211> 3552 
<212> DNA 

<213> Homo sapiens 
<400> 126 

cggcaggcag gtctcgtctc ggcaccctcc 
catcccgatg gccgccgctg ggccccggcg 
gctgctgacc ctcgtgatct tcagtcgtgc 
tgtaccttct aaactagagg cagacaaaat 
caggtctgca gacctcatcc ggtcaagtga 
agtgtacaca gccagggctg ttgcgctgtc 
ttctgacaaa aggaaacaga cacagaaaga 
ggtatcgaag acaagacaca ctagagaaac 
acctattcct tgctctatgc aagagaattc 
agttgaatct gatgcagcac agaactatac 
tgataaagaa cctttaaatt tgttttatat 
tcggcctgtg gatcgtgaag aatatgatgt 
agatggatat tcagcagatc tgcccctccc 
caaccaccct gttttcacag aagcaattta 
tggtactaca gtgggggtgg tttgtgccac 
gcgcctgaaa tacagcattt tgcagcagac 
tcccagcaca ggcgtaatca ccacagtctc 
gtactcattg ataatgaaag tacaagacat 
atcaacttgt atcataacag taacagattc 
tgcttatgaa gcatttgtag aggaaaatgc 
agaagataag gatttaatta acactgccaa 
aaatgaaaat ggacatttca aaatcagcac 
tgttgtaaag ccactgaatt atgaagaaaa 
caatgaagcg ccatttgcta gagatattcc 
tacagttcat gtgagggatc tggatgaggg 
gcggattaaa gaaaacttag cagtggggtc 
cgaaaataga aatggcaatg gtttaaggta 
caccattgat gaaatttcag ggtcaatcat 
aactcccaaa aatgagttgt ataatattac 
atgtactgga acacttgctg tgaacattga 
tcaagaatat gtagtcattt gcaaaccaaa 
tcctgatgaa cctgtccatg gagctccatt 
aatcagtaga ctgtggagcc tcaccaaagt 
gaaaaatgct ggatttcaag aatataccat 
agctgcaaca aaattattga gagttaatct 
tgcgacttca aggagtacag gagtaatact 
gggtatagca ctgctctttt ctgtattgct 
taaagggaaa cgttttcctg aagatttagc 
agcacctgga gacgatagag tgtgctctgc 
ctctagccaa ggtttttgtg gtactatggg 
cattgaaatg atgaaaggag gaaaccagac 
tcataccctg gactcctgca ggggaggaca 
ctcggagtgg cacagtttta ctcaaccccg 



tttctttaat ataggctttt tagcctattt 120 
attctgaatg tcatcaactt taccaaacct 180 
aagttactat tgactgaatt ttcttcattt 240 
ggggaatgaa gtataccaac ttctttcaga 300 
ggctgtcttg tttctagcca agggagagcc 360 
tttgcttgta ctattgccat tttagaaagc 420 
acttaaagcc aacaatttta aggcagtagt 480 

486 



cggcgcccgc gttctcctgg ccctgcccgg 60 
ctccgtgcgc ggagccgtct gcctgcatct 120 
tggtgaagcc tgcaaaaagg tgatacttaa 180 
aattggcaga gttaatttgg aagagtgctt 240 
tcctgatttc agagttctaa atgatgggtc 300 
tgataagaaa agatcattta ccatatggct 360 
ggttactgtg ctgctagaac atcagaagaa 420 
tgttctcagg cgtgccaaga ggagatgggc 480 
cttgggccct ttcccattgt ttcttcaaca 540 
tgtcttctac tcaataagtg gacgtggagt 600 
agaaagagac actggaaatc tattttgcac 660 
ttttgatttg attgcttatg cgtcaactgc 720 
actacccatc agggtagagg atgaaaatga 780 
taattttgaa gttttggaaa gtagtagacc 84 0 
agacagagat gaaccggaca caatgcatac 900 
accaaggtca cctgggctct tttctgtgca 960 
tcattatttg gacagagagg ttgtagacaa 1020 
ggatggccag ttttttggat tgataggcac 1080 
aaatgataat gcacccactt tcagacaaaa 1140 
attcaatgtg gaaatcttac gaatacctat 1200 
ttggagagtc aattttacca ttttaaaggg 1260 
agacaaagaa actaatgaag gtgttctttc 1320 
ccgtcaagtg aacctggaaa ttggagtaaa 1380 
cagagtgaca gccttgaaca gagccttggt 1440 
gcctgaatgc actcctgcag cccaatatgt 1500 
aaagatcaac ggctataagg catatgaccc 1560 
caaaaaattg catgatccta aaggttggat 1620 
aacttccaaa atcctggata gggaggttga 1680 
agtcctggca atagacaaag atgatagatc 1740 
agatgtaaat gataatccac cagaaatact 1800 
aatggggtat accgacattt tagctgttga 1860 
ttatttcagt ttgcccaata cttctccaga 1920 
taatgataca gctgcccgtc tttcatatca 1980 
tcctattact gtaaaagaca gggccggcca 2040 
gtgtgaatgt actcatccaa ctcagtgtcg 2100 
tggaaaatgg gcaatccttg caatattact 2160 
aactttagta tgtggagttt ttggtgcaac 2220 
acagcaaaac ttaattatat caaacacaga 2280 
caatggattt atgacccaaa ctaccaacaa 2340 
atcaggaatg aaaaatggag ggcaggaaac 2400 
cttggaatcc tgccgggggg ctgggcatca 24 60 
cacggaggtg gacaactgca gatacactta 2520 
tctcggtgaa aaattgcatc gatgtaatca 2580 
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gaatgaagac cgcatgccat cccaagatta 
atctccagct ggttctgtgg gctgctgcag 
tttaaataat ttggaaccca aatttattac 
tcacagtgct acaattaggt ctttgtcaga 
aaagttcaat ttcaacatgt atgtatatga 
ctcaccaatt tatattttta aagcaagttg 
ttaaaacaga caactggtaa atctcaaact 
ctgctctttt ttttttttac agatatttta 
aacaatagct aagttatgct aatatcacat 
aaaaataaac aagaaatatt gagtatcact 
gaagactgaa ttaaattaaa aatgttgcag 
actaccaaat tcatttgact ttggaggcaa 
ttttctatag gaatatagtt ggaaataaat 
aatatttaaa tgaaatgaga acaaagagga 
tatagtttgt cctacaatag aaaaaagaga 
gcattataac tgagtctatg aggaaatagt 
ttgtaaataa at 

<210> 127 
<211> 754 
<212> DNA 
<213> Homo sapiens 



tgtcctcact tataactatg agggaagagg 2640 
tgaaaagcag gaagaagatg gccttgactt 27 00 
attagcagaa gcatgcacaa agagataatg 2760 
cattctggag gtttccaaaa ataatattgt 2820 
tgattttttt ctcaattttg aattatgcta 2880 
ttgcttatct tttccaaaaa gtgaaaaatg 2940 
ccagcactgg aattaaggtc tctaaagcat 3000 
gtaataaata tgctggataa atattagtcc 3060 
tattatgtat tcactttaag tgatagttta 3120 
atgtgaagaa agttttggaa aagaaacaat 3180 
ctcataaaga attggactca cccctactgc 3240 
aatgtgttga agtgccctat gaagtagcaa 3300 
gtgtgtgtgt atattattat taatcaatgc 3360 
aaatggtaaa aacttgaaat gaggctgggg 3420 
gagcttccta ggcctgggct cttaaatgct 34 80 
tcctgtccaa tttgtgtaat ttgtttaaaa 3540 

3552 



<400> 127 

tttttttttt ttgtcattgt tcattgattt taatgagaaa gctaagagag gaaataagta 60 

gcctttcaaa ggtcacacag aagtaagtga cagatccagg attcatatcc aagcattctg 120 

gctctagtgt ccatgcttct caaccattat gacccaatat tcaaccaaat caatactgaa 180 

ggacacgtga aatgtatccg gtattttact attacaaaca aaaatccaat gaacattctt 240 

gaagacatac acaaaaataa tggttacaat agaagttact ggaattgaaa ttttggttca 300 

acctatatta aaatgtaagg cttttgatat agctaataga tttttgaaat gatcagtctt 360 

aacgtttgta ggggagcaca ctcctgcatg gggaaaagat tcactgtgaa gcacagagca 420 

cctttatggt tggatcatct tgtcattaaa gttcaggcgt tatctatcct gtaagtggca 480 

gaatcaagac tgcaatatcg cctgcttttc tttttaactc atgttttccc ttgactacac 540 

tggtcctcaa agtaaaaccc ctgtgtcagt gtactattca tggaatactc tgcaattata 600 

accaccttct aatactttta atacccaatc aaaatttatt atacatatgt atcatagata 660 

ctcatctgta aagctgtgct tcaaaatagt gatctcttcc caacattaca atatatatta .720 

atgatgtcga acctgcccgg gcggccgctc gaag 754 

<210> 128 
<211> 374 
<212> DNA 
<213> Homo sapiens 

<400> 128 

aggttttgat taaaaaggca aatgatttta ttgttcgata atcttttaaa aaaataagag 60 

gaaggagtaa aattaaagat gaaagatgat ttttatttcc ttgtgacctc tatatccccc 120 
ttcccctgcc cttggtaagt aactcttgat ggagaaagga ttaaagactc ttatttaacc 180 

aaaaaacaga gccagctaat catttccaaa ggttagtatc tccctgctga cctcttcttt 240 

ggtttaattg aataaaacta tatgttcata tatgtattaa aacaactcag aataacatct 300 

tttcttcctt agttaaggca ttataagggc tatactatca tccataataa ccaaggcaat 360 

aacttaaaaa gctg 374 

<210> 129 
<211> 546 
<212> DNA 
<213> Homo sapiens 



<400> 129 
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agtgtgatgg atatctgcag aattcgggct aagcgtggtc gcggcccgag gtctggaact 60 
tcccagcacy tgaaaaggag cctcctgagc tgactcggct aaagccccac tttcgctcct 120 
cctcatttct gcctactgat ttccttggag cattcatctg aatattaccg tttgctgtgt. 180 
aacctggtac atacatagca tgactccctg gaatagagtg ggctggggtg cttatgctgg 240 
gagagtgatt gacatgcact ttcaagctat atctaccatt tgcagcaaag gagaaaaaat 300 
acctcgagta aattccatca ttttttataa catcagcacc tgctccatca tcaaggagtc 360 
tcagcgtaac aggatctcca gtctctggct caactgtggc agtgacagtg gcattaagaa 420 
tgggataaaa tccctgtttc acattggcat aaatcatcac aggatgagga aaatggaggc 480 
tgtctctttc cacaaaggct tccacagtgg ctgggggcac agacctgccc gggcggccgc 540 
tcgaaa 546 

<210> 130 
<211> 5156 
<212> DNA 

<213> Homo sapiens 
<400> 130 

accaaccgag gcgccgggca gcgacccctg cagcggagac agagactgag cggcccggca 60 
ccgccatgcc tgcgctctgg ctgggctgct gcctctgctt gtcgctcctc ctgcccgcag 120 
cccgggccac ctccaggagg gaagtctgtg attgcaatgg gaagtccagg cagtgtatct 180 
ttgatcggga acttcacaga caaactggta atggattccg ctgcctcaac tgcaatgaca 24 0 
acactgatgg cattcactgc gagaagtgca agaatggctt ttaccggcac agagaaaggg 300 
accgctgttt gccctgcaat tgtaactcca aaggttctct tagtgctcga tgtgacaact 360 
ccggacggtg cagctgtaaa ccaggtgtga caggagccag atgcgaccga tgtctgccag 420 
gcttccacat gctcacggat gcggggtgca cccaagacca gagactgcta gactccaagt 480 
gtgactgtga cccagctggc atcgcagggc cctgtgacgc gggccgctgt gtctgcaagc 54 0 
cagctgtcac tggagaacgc tgtgataggt gtcgatcagg ttactataat ctggatgggg 600 
ggaaccctga gggctgtacc cagtgtttct gctatgggca ttcagccagc tgccgcagct 660 
ctgcagaata cagtgtccat aagatcacct ctacctttca tcaagatgtt gatggctgga 720 
aggctgtcca acgaaatggg tctcctgcaa agctccaatg gtcacagcgc catcaagatg 7 80 
tgtttagctc agcccaacga ctagaccctg tctattttgt ggctcctgcc aaatttcttg 840 
ggaatcaaca ggtgagctat ggtcaaagcc tgtcctttga ctaccgtgtg gacagaggag 900 
gcagacaccc atctgcccat gatgtgattc tggaaggtgc tggtctacgg atcacagctc 960 
ccttgatgcc acttggcaag acactgcctt gtgggctcac caagacttac acattcaggt 1020 
taaatgagca tccaagcaat aattggagcc cccagctgag ttactttgag tatcgaaggt 1080 
tactgcggaa tctcacagcc ctccgcatcc gagctacata tggagaatac agtactgggt 1140 
acattgacaa tgtgaccctg atttcagccc gccctgtctc tggagcccca gcaccctggg 1200 
ttgaacagtg tatatgtcct gttgggtaca aggggcaatt ctgccaggat tgtgcttctg 1260 
gctacaagag agattcagcg agactggggc cttttggcac ctgtattcct tgtaactgtc 1320 
aagggggagg ggcctgtgat ccagacacag gagattgtta ttcaggggat gagaatcctg 1380 
acattgagtg tgctgactgc ccaattggtt tctacaacga tccgcacgac ccccgcagct 14 40 
gcaagccatg tccctgtcat aacgggttca gctgctcagt gatgccggag acggaggagg 1500 
tggtgtgcaa taactgccct cccggggtca ccggtgcccg ctgtgagctc tgtgctgatg 1560 
gctactttgg ggaccccttt ggtgaacatg gcccagtgag gccttgtcag ccctgtcaat 1620 
gcaacaacaa tgtggacccc agtgcctctg ggaattgtga ccggctgaca ggcaggtgtt 1680 
tgaagtgtat ccacaacaca gccggcatct actgcgacca gtgcaaagca ggctacttcg 1740 
gggacccatt ggctcccaac ccagcagaca agtgtcgagc ttgcaactgt aaccccatgg 1800 
gctcagagcc tgtaggatgt cgaagtgatg gcacctgtgt ttgcaagcca ggatttggtg 1860 
gccccaactg tgagcatgga gcattcagct gtccagcttg ctataatcaa gtgaagattc 1920 
agatggatca gtttatgcag cagcttcaga gaatggaggc cctgatttca aaggctcagg 1980 
gtggtgatgg agtagtacct gatacagagc tggaaggcag gatgcagcag gctgagcagg 2040 
cccttcagga cattctgaga gatgcccaga tttcagaagg tgctagcaga tcccttggtc 2100 
tccagttggc caaggtgagg agccaagaga acagctacca gagccgcctg gatgacctca 2160 
agatgactgt ggaaagagtt cgggctctgg gaagtcagta ccagaaccga gttcgggata 2220 
ctcacaggct catcactcag atgcagctga gcctggcaga aagtgaagct tccttgggaa 22 80 
acactaacat tcctgcctca gaccactacg tggggccaaa tggctttaaa agtctggctc 2340 
aggaggccac aagattagca gaaagccacg ttgagtcagc cagtaacatg gagcaactga 2400 
caagggaaac tgaggactat tccaaacaag ccctctcact ggtgcgcaag gccctgcatg 24 60 
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aaggagtcgg aagcggaagc ggtagcccgg acggtgctgt ggtgcaaggg cttgtggaaa 2520 
aattggagaa aaccaagtcc ctggcccagc agttgacaag ggaggccact caagcggaaa 2580 
ttgaagcaga taggtcttat cagcacagtc tccgcctcct ggattcagtg tctcggcttc 2640 
agggagtcag tgatcagtcc tttcaggtgg aagaagcaaa gaggatcaaa caaaaagcgg 2700 
attcactctc aagcctggta accaggcata tggatgagtt caagcgtaca cagaagaatc 2760 
tgggaaactg gaaagaagaa gcacagcagc tcttacagaa tggaaaaagt gggagagaga 2820 
aatcagatca gctgctttcc cgtgccaatc ttgctaaaag cagagcacaa gaagcactga 2880 
gtatgggcaa tgccactttt tatgaagttg agagcatcct taaaaacctc agagagtttg 2940 
acctgcaggt ggacaacaga aaagcagaag ctgaagaagc catgaagaga ctctcctaca 3000 
tcagccagaa ggtttcagat gccagtgaca agacccagca agcagaaaga gccctgggga 3060 
gcgctgctgc tgatgcacag agggcaaaga atggggccgg ggaggccctg gaaatctcca 3120 
gtgagattga acaggagatt gggagtctga acttggaagc caatgtgaca gcagatggag 3180 
ccttggccat ggaaaaggga ctggcctctc tgaagagtga gatgagggaa gtggaaggag 3240 
agctggaaag gaaggagctg gagtttgaca cgaatatgga tgcagtacag atggtgatta 3300 
cagaagccca gaaggttgat accagagcca agaacgctgg ggttacaatc caagacacac 3360 
tcaacacatt agacggcctc ctgcatctga tggaccagcc tctcagtgta gatgaagagg 3420 
ggctggtctt actggagcag aagctttccc gagccaagac ccagatcaac agccaactgc 3480 
ggcccatgat gtcagagctg gaagagaggg cacgtcagca gaggggccac ctccatttgc 3540 
tggagacaag catagatggg attctggctg atgtgaagaa cttggagaac attagggaca 3600 
acctgccccc aggctgctac aatacccagg ctcttgagca acagtgaagc tgccataaat 3660 
atttctcaac tgaggttctt gggatacaga tctcagggct cgggagccat gtcatgtgag 3720 
tgggtgggat ggggacattt gaacatgttt aatgggtatg ctcaggtcaa ctgacctgac 3780 
cccattcctg atcccatggc caggtggttg tcttattgca ccatactcct tgcttcctga 3840 
tgctgggcaa tgaggcagat agcactgggt gtgagaatga tcaaggatct ggaccccaaa 3900 
gaatagactg gatggaaaga caaactgcac aggcagatgt ttgcctcata atagtcgtaa 3960 
gtggagtcct ggaatttgga caagtgctgt tgggatatag tcaacttatt ctttgagtaa 4020 
tgtgactaaa ggaaaaaact ttgactttgc ccaggcatga aattcttcct aatgtcagaa 4080 
cagagtgcaa cccagtcaca ctgtggccag taaaatacta ttgcctcata ttgtcctctg 4140 
caagcttctt gctgatcaga gttcctccta cttacaaccc agggtgtgaa catgttctcc 4200 
attttcaagc tggaagaagt gagcagtgtt ggagtgagga cctgtaaggc aggcccattc 4260 
agagctatgg tgcttgctgg tgcctgccac cttcaagttc tggacctggg catgacatcc 4320 
tttcttttaa tgatgccatg gcaacttaga gattgcattt ttattaaagc atttcctacc 4380 
agcaaagcaa atgttgggaa agtatttact ttttcggttt caaagtgata gaaaagtgtg 444 0 
gcttgggcat tgaaagaggt aaaattctct agatttatta gtcctaattc aatcctactt 4500 
ttagaacacc aaaaatgatg cgcatcaatg tattttatct tattttctca atctcctctc 4560 
tctttcctcc acccataata agagaatgtt cctactcaca cttcagctgg gtcacatcca 4 620 
tccctccatt catccttcca tccatctttc catccattac ctccatccat ccttccaaca 4680 
tatatttatt gagtacctac tgtgtgccag gggctggtgg gacagtggtg acatagtctc 4740 
tgccctcata gagttgattg tctagtgagg aagacaagca tttttaaaaa ataaatttaa 4800 
acttacaaac tttgtttgtc acaagtggtg tttattgcaa taaccgcttg gtttgcaacc 4860 
tctttgctca acagaacata tgttgcaaga ccctcccatg ggggcacttg agttttggca 4920 
aggctgacag agctctgggt tgtgcacatt tctttgcatt ccagctgtca ctctgtgcct 4980 
ttctacaact gattgcaaca gactgttgag ttatgataac accagtggga attgctggag 5040 
gaaccagagg cacttccacc ttggctggga agactatggt gctgccttgc ttctgtattt 5100 
ccttggattt tcctgaaagt gtttttaaat aaagaacaat tgttagaaaa aaaaaa 5156 



<210> 131 
<211> 671 
<212> DNA 
<213> Homo sapiens 

<400> 131 

aggtctggag ggcccacagc cggatgtggg acaccgggaa aaagtggtca tagcacacat 60 

ttttgcatcc cggttgcagt gtgttgcaga cgaagtcctc ttgctcgtca ccccacactt 120 

cctgggcagc caycacgagg atcatgactc ggaaaataaa gatgactgtg atccacacct 18 0 

tcccgatgct ggtggagtgt ttgttgacac ccccgatgaa agtgtgcagc gtcccccaat 240 
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ccattgcgct ggtttatccc tgagtcctgt 

agaatgaggg caagatccct ctgcgagggt 

tgcctagaag ccaatgggtg cacagtgatg 

aggatgtcgc ctggaatatt caaattgaat 

gaatttttct ttcgccatac agaaattgtt 

cctgaccctt cctgctcccc aggaagggag 

cgtgacaccg gagacaggtc ttcttcaccg 

ttaactgcta t 

<210> 132 
<211> 590 
<212> DNA 
<213> Homo sapiens 

<40Q> 132 

ctgaatggaa aagcttatgg ctctgtgatg 

cttggcaatt gcttacccac tgtgctcagc 

ggttcatctg cagccccaaa tctggaggaa 

tttgttccag atatatcaaa ctccaatagc 

ggaactggag acattttcca gcaacatatt 

cctcaccatc aattgaaaaa cacagtgact 

tttctagtta cgtggcaggc cagtggtcct 

cgaaaatact acacaaataa ttttatcacc 

attccaggaa cagctaagcc tgggcactgg 

ctgcaagccc tgaaagtgac agtgacctct 

<210> 133 
<211> 581 
<212> DNA 

<213> Homo sapiens . 
<400> 133 

aggtcctgtc cgggggcact gagaactccc 
actgtgggcc tggagataaa acttgtctcc 
ctgtcctcat ctctgcaaag ttcagcttcc 
atgctctggg gagctcatgg gtggaggagt 
tgggccaggg atgaatattt gagggataaa 
gggggagaac agagaggagc tgggctatgg 
tggctggata tctggtacta aaaaagggtc 
aatccaaacc atagctgtct gtccagtgct 
cctcctagac tctgtccctg ggctagggca 
ggctgaggag agtgtgacat gtggggagag 

<210> 134 

<211> 4797 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 135, 501, 4421, 4467, 4468, 

<223> n = A,T,C or G 

<400> 134 

cctgggacca aagtgctgcc cagagctgag 
ctgtgtacct gtgcagcaca gggtagggtg 
ggagcagcag agacncgcca agcctttact 
ttgtggctac taatttgccc cctgaagatc 



ttccaacgac tgccagtgtt tcagacccaa 300 
ttcagacctc cttctcctac cccactggag 360 
atacgaatgt caatctttgc tcggtcagtg 420 
tacagatgca tgaagagggc gtacaagtta 480 
tagccagatc ttctgtactt cttttccttc 540 
gtcagccccg tttgcaaaac acaggatgcc 600 
acaggaagtg ccttctggtg cctgcacgtt 660 

671 



atattagtga ccagcggaga tgataagctt 60 
agtggttcaa caattcactc cattgccctg 120 
ttatcacgtc ttacaggagg tttaaagttc 180 
atgattgatg ctttcagtag aatttcctct 240 
cagcttgaaa gtacaggtga aaatgtcaaa 300 
gtggataata ctgtgggcaa cgacactatg 360 
cctgagatta tattatttga tcctgatgga 420 
aatctaactt ttcggacagc tagtctttgg 480 
acttacaccc tgaacaatac ccatcattct 540 
cgcgcctcca actcagacct 590 



tctggaattc ttggggggtg ttggggagag 60 
tctaccacca ccctgtaccc tagcctgcac 120 
ttccccaggt ctctgtgcac tctgtcttgg 180 
ctccaccaga gggaggctca ggggactggt 240 
aattgtgtaa gagccaaaga attggtagta 300 
gaaatgattt gaataatgga gctgggaata 360 
tttaagaacc tacttcctaa tctcttcccc 420 
ctcttcctgc ctccagctct gccccaggct 480 
ggggaggagg gagagcaggg ttgggggaga 540 
gaccagacct c 581 



4698 



ggtcctggag ccacatgaga aggcttctcc 60 
agtccactca gctgtctagg agaggaccca 120 
cataccatat tctgatcctt ttccagcaaa 180 
aagatggctc tggggatgac tctgacaact 240 
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tctccggctc aggtgcaggt gaggttgtca 
ggtcatgcct gggggcagtg gtcaggcagt 
caccctgcct gccctgtctc cacccagctg 
ggggtcgtga gctgctgtta aggagagctc 
tccagaaggc cccagggcgc nctgctgcac 
ctcactaaga aacctctgga acccccttca 
tctcatctgc aaaatgggaa taataccttg 
cacagagcca gctggggtgt agctcttcca 
tgtggggact gggggagaga agtccctgag 
aggtggctct tgagtggacc tcaggaagag 
tcatccctgg ggaagtgacc tagcggaggc 
tggaagtgtc tgttgttgga agtgggggcc 
tgtgtgccct gggataagta ggataaccac 
cacccctgtg gtcacagaaa agctttccca 
gagacaggct gcacctgaca cacacaatgg 
gagcttagcc tcagctgcct tgtccaggta 
gcccaggtgc tctggagcct cccccgaccc 
accccccacc tccccaacac , actctgcttc 
cttgtcacag cagaccccct ccacttggaa 
gtctccagaa cccaccggcc tggaggctac 
agaggggccc aaggagggag aggctgtagt 
ccgggagcag gaggccaccc cccgacccag 
ggcctcaacg accacagcca ccacggccca 
catgcagcct ggccaccatg agacctcaac 
cactccccac acagaggatg gaggtccttc 
ctccagtcag ctcccagcag cagagggctc 
ttgggaaatt gagtgggttg gtcctaatgc 
ctgcgcgatc tcgtattcct caccaggaag 
ccagggcctc gcagagcagg acagactaac 
atcacccaag agagggctcc caaactcaca 
aacgttatac cagtcatttt atttatagct 
gctattcata caaaatgtgt gctttgtatc 
ccagggtccg gagttgatgt ggcaagaagg 
ttgggtgcat ctgagtgggt ggtggcaaag 
gtagtggagc tggttgctgc tgctggcggt 
cccacaggac ttcacctttg aaacctcggg 
tgaccgccgg aaccagtccc cagtggatca 
ggacaggaaa gaggtgctgg gaggtgagtt 
gctgctgtgg ggtcagggtg gggctgacca 
cacgagagcc caaggagccg ctgagctgag 
tgccggaggc ctcgtggggc tcatctttgc 
catgaagaag aaggacgaag gcagctactc 
ggcctaccag aagcccacca aacaggagga 
ccctccgccc tgccactcac taggccccca 
tggcctcccc tgccaccagg ccacctcccc 
cacggagtcg tgggtgtgct gggagctcca 
tagggcacca ggggtttctc gcataggacc 
ccattctgac tcggtttctc caaactgaag 
gagggggatc cgactgcttt ggacctaaat 
ggggcttggg gctcacacac ctgtagcact 
ggccgctgag tggcagggga caggagtcac 
tcgacttgtt tttgcacatg tttcctctag 
acttctgagg taagttaagt aagttgattc 
tggtcgggag acagcatcag ggttaagaag 
ccaaatctgg aagccaaaat gtaggcttag 
atgtgtgcaa cagggtatgg actatctgtc 
ggctggccag tccaggctgc cgtggggccg 
catgcgctca gggccatgct gaggcctggg 



tgggggcccc ccccacccaa gacggcaaca 300 
ctcctgtgtt tactgagcat gtactgagtg 360 
gctccaaagg' gcaatgctga ggagaggaat 420 
atgcttggag gtgaggtgaa ggctgtgagc 480 
gcaggctcat attcactagg aatagcttta 540 
gaaggttatt tgactcctga gcctctattt 600 
acctgataag cttgtggagc tgtaaggcag 660 
tccaagctcc cttccttact tcccctttcc 720 
ctggaggtgg tcagggaagc ttcacagagg 780 
gggtgagaga gctaaggaag gaggctgagg 840 
ctgagagctg caaggtagga tatctgttgt 900 
tttttttcag ggagggtggg gccagagaag 960 
agtagttatg cccctaaggg atgcccaccc 1020 
ggtggcctag gcacctgtct cgtggctcca 1080 
aaggacagct ctccttgtcc attttccaag 1140 
ctagcctccc tcatagcctg agcttggcca 1200 
acccaacaca ctctgcttct ggtcctcccc 1260 
tggtcctgca ggtgctttgc aagatatcac 1320 
ggacacgcag ctcctgacgg ctattcccac 1380 
agctgcctcc acctccaccc tgccggctgg 1440 
cctgccagaa gtggagcctg gcctcaccgc 1500 
ggagaccaca cagctcccga ccactcatca 1560 
ggagcccgcc acctcccacc cccacaggga 1620 
ccctgcagga cccagccaag ctgaccttca 1680 
tgccaccgag agggctgctg aggatggagc 1740 
tggggagcag gtgagtggcc tctgcattcc 1800 
ctggcacttg gcaggcccta cacctgtgcc 1860 
acagggcaca ggggccgcct tcccctaccc 1920 
tatgagatca gagcagaagc acccttaaag 1980 
atccaaactt gcagccctcg tcgaagagtg 2040 
tcgtggattt acgcttacac taaatagtct 2100 
actttttgtg atatccatgc catggtccag 2160 
cctggctttc gggccctgtg cgatcctggt 2220 
atcagggagg caggagctgc ttctgggtct 2280 
gacctggcca acccaatctg cccctgccct 2340 
ggagaatacg gctgtagtgg ccgtggagcc 2400 
gggggccacg ggggcctcac agggcctcct 24 60 
ttctttcagg ggggtagttt ggggtgaatt 2520 
cagccaaggc cactgctttg ggagggtctg 2580 
ctggccccgt ctacctgccc taggggtcat 2640 
tgtgtgcctg gtgggtttca tgctgtaccg 2700 
cttggaggag ccgaaacaag ccaacggcgg 2760 
attctatgcc tgacgcggga gccatgcgcc 2820 
cttgcctctt ccttgaagaa ctgcaggccc 2880 
agcattccag cccctctggt cgctcctgcc 2940 
ctctgcttct ctgacttctg cctggagact 3000 
tttccaccac agccagcacc tggcatcgca 3060 
cagcctctcc ccaggtccag ctctggaggg 3120 
ggcctcatgt ggctggaaga tcctgcgggt 3180 
tactggtagg accaagcatc ttgggggggt 3240 
tttgtttcgt ggggaggtct aatctagata 3300 
ttctttgttc atagcccagt agaccttgtt 3360 
ggtatccccc catcttgctt ccctaatcta 3420 
actttttttt ttttttttaa actaggagaa 3480 
tttgtgtgtt gtctcttgag tttgtcgctc 3540 
tggtggcccc gttctggtgg tctgttggca 3600 
ccgcctcttt caagcagtcg tgcctgtgtc 3660 
ccgctgccac gttggagaag cccgtgtgag 3720 
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aagtgaatgc tgggactcag ccttcagaca 
tggagatcct cctgcaggct cacgcccgtc 
cctcctggaa attgacgagg ggtgtcttgg 
ggccaggttc tccgttagct cctgtggccc 
attttccaaa gagtgatagt cttttgcttt 
ttccctgtac agtagatttt ccaaatgtaa 
ccactgcctt cgcttcttgc ctctgtgctg 
accaacatgt tgggaaactt ggctcgaatc 
attctggttc cagggtccct ctgtgtattt 
aggtcggtag gttcagccaa ggttttataa 
ccaagcccat cttctaaatg gcaaaggaag 
gctgtggtca cagcggaggt gtggagccga 
cctcccaccc ggctgcagag gccaganncc 
tgacaacgtt tcagcgactc cgttggccac 
atgcaccacc aagccaaggg aacctgtgtc 
agtgtatgac tgcacatgac tcgggggtgg 
ctggtccgtg ggacggtncc caagccagag 
gtacttgtca tttcgggcaa cggctgctgt 

<210> 135 

<211> 2856 

<212> DNA 

<213> Homo sapiens 

<400> 135 

tagtcgcggg tccccgagtg agcacgccag 
gtcagagtcg cagtgggagt ccccggaccg 
cgcacgcccg tcgccacccg cgtacccggc 
tggagcccag cagcaagaag ctgacgggtc 
ttggctccct gcagtttggc tacaacactg 
aggagttcta caaccagaca tgggtccacc 
tcaccacgct ctggtccctc tcagtggcca 
tctctgtggig ccttttcgtt aaccgctttg 
tgctggcctt cgtgtccgcc gtgctcatgg 
tgdtgatcct gggccgcttc atcatcggtg 
ccatgtatgt gggtgaagtg tcacccacag 
agctgggcat cgtcgtcggc atcctcatcg 
gcaacaagga cctgtggccc ctgctgctga 
gcatcgtgct gcccttctgc cccgagagtc 
agaaccgggc caagagtgtg ctaaagaagc 
tgcaggagat gaaggaagag agtcggcaga 
agctgttccg ctcccccgcc taccgccagc 
cccagcagct gtctggcatc aacgctgtct 
cgggggtgca gcagcctgtg tatgccacca 
ctgtcgtgtc gctgtttgtg gtggagcgag 
tcgctggcat ggcgggttgt gccatactca 
taccctggat gtcctatctg agcatcgtgg 
tgggtcctgg ccccatccca tggttcatcg 
cagctgccat tgccgttgca ggcttctcca 
gcttccagta tgtggagcaa ctgtgtggtc 
tggttctgtt cttcatcttc acctacttca 
atgagatcgc ttccggcttc cggcaggggg 
agctgttcca tcccctgggg gctgattccc 
gcctgctccc agcagcccta aggatctctc 
aacctgacag atgtcagccg agccgggcct 
ccagaagaat attcaggact taacggctcc 
aaatctattc agacaagcaa caggttttat 
atatcagcct gagtctcctg tgcccacatc 



gagaggactg tagggagggc ggcaggggcc 3780 
ctcctgtggc gccgtctcca ggggctgctt 3840 
gcagagctgg ctctgagcgc ctccatccaa 3900 
caccctgggc cctgggctgg aatcaggaat 3960 
tggcaaaact ctacttaatc caatgggttt 4020 
taaactttaa tataaagtag tctgtgaatg 4080 
tgtgtgacgt gaccggactt ttctgcaaac 4140 
tctgtgcctt cgtctttccc atggggaggg 4200 
gcttttttgt tttggctgaa attctcctgg 4260 
ggctgatgtc aatttctgtg ttgccaagct 4320 
gtggatggcc ccagcacagc ttgacctgag 4380 
ggcctacccc ncagacacct tggacatcct 4 440 
agcccagggt cctgcactta cttgcttatt 4500 
tccgagagtg ggccagtctg tggatcagag 4560 
cggtattcga tactgcgact ttctgcctgg 4620 
ggaaaggggt cggctgacca tgctcatctg 4680 
gtgggttcat ttgtgtaacg acaataaacg 4740 
ggtggtggtt gagtctcttc ttggcct 4797 



ggagcaggag accaaacgac gggggtcgga 60 

gagcacgagc ctgagcggga gagcgccgct 120 

gcagccagag ccaccagcgc agcgctgcca 180 

gcctcatgct ggctgtggga ggagcagtgc 240 

gagtcatcaa tgccccccag aaggtgatcg 300 

gctatgggga gagcatcctg cccaccacgc 360 

tcttttctgt tgggggcatg attggctcct 42 0 

gccggcggaa ttcaatgctg atgatgaacc 480 

gcttctcgaa actgggcaag tcctttgaga 540 

tgtactgcgg cctgaccaca ggcttcgtgc 600 

cctttcgtgg ggccctgggc accctgcacc 660 

cccaggtgtt cggcctggac tccatcatgg 720 

gcatcatctt catcccggcc ctgctgcagt 780 

cccgcttcct gctcatcaac cgcaacgagg 840 

tgcgcgggac agctgacgtg acccatgacc 900 

tgatgcggga gaagaaggtc accatcctgg 960 

ccatcctcat cgctgtggtg ctgcagctgt 1020 

tctattactc cacgagcatc ttcgagaagg 1080 

ttggctccgg tatcgtcaac acggccttca 1140 

caggccggcg gaccctgcac ctcataggcc 1200 

tgaccatcgc gctagcactg ctggagcagc 1260 

ccatctttgg ctttgtggcc ttctttgaag 1320 

tggctgaact cttcagccag ggtccacgtc 1380 

actggacctc aaatttcatt gtgggcatgt 1440 

cctacgtctt catcatcttc actgtgctcc 1500 

aagttcctga gactaaaggc cggaccttcg 1560 

gagccagcca aagtgataag acacccgagg 1620 

aagtgtgagt cgccccagat caccagcccg 1680 

aggagcacag gcagctggat gagacttcca 1740 

ggggctcctt tctccagcca gcaatgatgt 1800 

aggattttaa caaaagcaag actgttgctc 1860 

aattttttta ttactgattt tgttattttt 1920 

ccaggcttca ccctgaatgg ttccatgcct 1980 
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gagggtggag 
ctggacctat 
gaggtggcta 
cattaggatt 
cctgagacca 
gccgggttct 
gggagcctgc 
tgcaagatat 
atatctggac 
tataaatggc 
tttggatggg 
gactcaggat 
tttgatccct 
atcacatatt 
aggcttgaaa 



actaagccct 
gtcctaagga 
tggccacccg 
tgccccttcc 
gttgggagca 
agtctccttt 
aaactcactg 
ttatatatat 
aagccaactt 
tggtttttag 
agtgagacag 
ccagtccctt 
gttacccaga 
tgatagttgg 
tcgcattatt 



gtcgagacac 
cacactaatc 
ttctgctggc 
catctcttcc 
ctggagtgca 
gcactgaggg 
ctcaagaaga 
ttttggttgt 
gtaaatacac 
aaacatggtt 
aagtaagtgg 
acacgtacct 
gaatatatac 
tgttcaaaaa 
ttgaatgtga 



ttgccttctt 
gaactatgaa 
ctggatctcc 
tacccaacca 
gggaggagag 
ccacactatt 
catggagact 
caatattaaa 
cacctcactc 
ttgaaatgct 
ggttgcaacc 
ctcatcagtg 
attctttatc 
aacactagtt 
agggaa 



cacccagcta 
ctacaaagct 
ccactctagg 
ctcaaattaa 
gggaagggcc 
accatgagaa 
cctgccctgt 
tacagacact 
ctgttactta 
tgtggattga 
actgcaacgg 
tcctcttgct 
ttgacattca 
ttgtgccagc 



atctgtaggg 
tctatcccag 
ggtcaggctc 
tctttcttta 
agtctgggct 
gagggcctgt 
tgtgtataga 
aagttatagt 
cctaaacaga 
gggtaggagg 
cttagacttc 
caaaaatctg 
aggcatttct 
cgtgatgctc 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2856 



<210> 136 
<211> 356 
<212> DNA 
<213> Homo sapiens 



<400> 136 

ggtggagcca 

aggcattgat 

tgaccacaca 

agtgctactt 

tgaaggaaac 

agaagagacc 



aatgaagaaa 
gatgatgaag 
aaacagaacc 
cagacaacca 
tggaacccag 
ccacattcta 



atgaagatga 
attttatctc 
aggactggac 
caaggatgac 
aagcacaccc 
caagcacaat 



aagagacaga 
cagcaccatt 
tcagtggaac 
tgatgtagac 
tcccctcatt 
ccaggcaact 



cacctcagtt 
tcaaccacac 
ccaagccatt 
agaaatggca 
caccatgagc 
cctagtagta 



tttctggatc 60 
cacgggcttt 120 
caaatccgga 180 
ccactgctta 240 
atcatgagga 300 
caacgg 356 



<210> 137 
<211> 356 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 254, 264, 279, 281, 290, 328, 342 
<223> n = A,T,C or G 



<400> 137 

gcaggtggag 

gtcactggct 

ctatgtctcc 

cctttttctc 

cgctaaggcc 

ggacccaaac 



aagacatttt 
gcccccggaa 
cagcaaggac 
aaagacatcg 
aaanttccag 
tctggcgcgg 



attgttcctg 
cagggcgctg 
agaaactcag 
gcgaggtaat 
acanayggcc 
aaacacangg 



gggtctctgg 
ctccatggct 
aaaaatcaat 
ttgtgccctt 
gggccggtnc 
gcataagctt 



aggcccattg 
ctgcttgtgg 
cttcttatcc 
tttacctcgg 
nataggggan 
gnttcctgtg 



gtggggctgg 60 
tagtctgtgg 120 
tcattcttgt 180 
cccgcgacca 24 0 
cccaacttgg 300 
gggaaa 356 



<210> 138 
<211> 353 
<212> DNA 
<213> Homo 



sapiens 



<400> 138 

aggtccagtc 

aatagacact 

tacattgatg 

aatgccagcc 

ctccagccgt 



ctccacttgg 
tagatttctc 
tggaaattgc 
atcttggcat 
gtctttatgt 



cctgatgaga 
tcttgtggga 
tgctgctacc 
cctggccctc 
caagcagcat 



gtggggagtg 
agaaaccacc 
accacctcct 
gagcaggctg 
cttgtactcc 



gcaagggacg 
tgtccatcca 
gaagaggctt 
cggtaagtag 
tggttctgag 



tttctcctgc 60 
ctgactcttc 120 
ccctgatgcc 180 
cgatctcctg 240 
cctccatctc 300 
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gcatcggagc 


tcactcagac 


ctcgsccgsg 


mssmcgctam 


gccgaattcc 


age 


353 


<210> 139 














<211> 371 














<212> DNA 














<213> Homo 


sapiens 












<400> 139 














agcgtggtcg 


cggccgaggt 


ccatccgaag 


caagattgca 


gatggcagtg 


tgaagagaga 


60 


agacatattc 


tacacttcaa 


agctttggtg 


caattcccat 


cgaccagagt 


tggtccgacc 


120 


agccttggaa 


aggtcactga 


aaaatcttca 


attggattat 


gttgacctct 


accttattca 


180 


ttttccagtg 


tctgtaaagc 


caggtgagga 


agtgatccca 


aaagatgaaa 


atggaaaaat 


O A fi 
£. *A U 


actatttgac 


acagtggatc 


tctgtgccac 


gtgggaggcc 


gtggagaagt 


gtaaagatgc 




aggattggac 


ctgcccgggc 


ggccgctcga 


aagccgaatt 


ccagcacact 


ggcggccgtt 




actagtggat 


c 










"3 "71 


<210> 140 














<211> 370 














<212> DNA 














<213> Homo 


sapiens 












<400> 140 














tagcgtggtc 


gcggccgagg 


tccatctccc 


tttgggaact 


agggggctgc 


tggt^ggaaa 


60 


tgggagccag 


ggcagatgtt 


gcattccttt 


gtgtccctgt 


aaatgtggga 


ctacaagaag 


120 


aggagctgcc 


tgagtggtac 


tttctcttcc 


tggtaatcct 


ctggcccagc 


ctcatggcag 


180 


aatagaggta 


tttttaggct 


atttttgtaa 


tatggcttct 


ggtcaaaatc 


cctgtgtagc 


240 


tgaattccca 


agccctgcat 


tgtacagccc 


cccactcccc 


tcaccaccta 


ataaaggaat 


300 


agttaacact 


caaaaaaaaa 


aaaaaacctg 


cccgggcggc 


cgctcgaaag 


ccgaattcca 




gcacactggc 












O *7 A 


<210> 141 














<211> 371 














<212> DNA 














<213> Homo 


sapiens 












<400> 141 














tagcgtggtc 


gcggccgagg 


tcctctgtgc 


tgcctgtcac 


agcccgatgg 


taccagcgca 


60 


gggtgtaggc 


agtgcaggag 


ccctcatcca 


gtggcaggga 


acaggggtca 


tcactatccc 


120 


aaggagcttc 


agggtcctgg 


tactcctcca 


cagaatactc 


ggagtattca 


gagtactcat 


180 


catcctcagg 


gggtacccgc 


tcttcctcct 


ctgcatgaga 


gacgcggagc 


acaggcacag 


o a n 


catggagctg 


ggagccggca 


gtgtctgcag 


cataactagg 


gaggggtcgt 


gatccagatg 


oUU 


cgatgaactg 


gccctggcag 


gcacagtgct 


gactcatctc 


ttggcgacct 


gcccgggcgg 


JOU 


ccgctcgaag 


c 












<210> 142 














<211> 343 














<212> DNA 














<213> Homo 


sapiens 












<400> 142 














gcgttttgag 


gccaatggtg 


taaaaggaaa 


tatcttcaca 


taaaaactag 


atggaagcat 


60 


tgtcagaaac 


ctctttgtga 


tgtttgcttt 


caactcacag 


agttgaacat 


tccttttcat 


120 


agagcagttt 


tgaaacactc 


ttttgtagaa 


tttgcaagcg 


gatgattgga 


tegctatgag 


180 


gtcttcattg 


gaaacgggat 


acctttacat 


aaaaactaga 


cagtagcatt 


ctcagaaatt 


240 


tctttgggat 


gtgggcattc 


aacccacaga 


ggagaacttc 


atttgataga 


gcagttttga 


300 


aacacccttt 


ttgtagaatc 


tacaggtgga 


catttagagt 


get 




343 



<210> 143 
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<211> 354 
<212> DNA 
<213> Homo sapiens 



<400> 143 

aggtctgatg 

catcaggagt 

gtggtggagt 

aaattccatc 

agcaaatctc 

cataaacatt 



gcagaaaaac 
gggatgggaa 
gtgtcatgaa 
atcactttgg 
catactgttt 
ttacatgcag 



tcagactgtc 
ggaaagcaca 
caatgtcacc 
acaggagtta 
ctttcttttt 
ctatttcaaa 



tgcaacttta 
ataacaagaa 
tgtactcgga 
attaagagaa 
tttttcatta 
gtgtgttgga 



cagatggtgc 
aattgaaaga 
tctatgaaaa 
tgaccaagct 
ctgtgttcaa 
ttaattagga 



attggttcag 60 
tgggaaatta 120 
agtagaataa 180 
cagttcaatg 240 
ttatctttat 300 
teat 354 



<210> 144 
<211> 353 
<212> DNA 
<213> Homo sapiens 



<400> 144 

ggtcaaggac 

cctagagcac 

aagatgacag 

gttaagttgc 

gaaaccatgc 

aggtttgect 



ctgggggacc 
atctggatct 
actaagtagg 
ttaactttca 
cccagagaag 
gataccagac 



cccaggtcca 
cagccccacc 
attctgecat 
ttctgtctta 
gttaagtgac 
ctgtggcccc 



gcagccacat 
cctggcaacc 
ttagaataat 
cgatagtctt 
ttcctcttta 
acctcccatg 



gattctgeag 
tgcctgccta 
tctggtatcc 
cagaggtggg 
tggagccagt 
caggtctctg 



cagacaggga 60 
gagaactccc 120 
tgggcgttgc 180 
aacagatgaa 240 
gttccaacct 300 
tgg . 353 



<210> 145 
<211> 371 
<212> DNA 
<213> Homo sapiens 



<400> 145 

caggtctgtc 

ttcctgagac 

attgecactg 

aatgeaaact 

tggaatttgg 

atgtgagacc 

tagtggatcc 



ataaactggt 
ttgctggcct 
ttgatcacta 
gcaagaatca 
ggtgtcctta 
tcggccgcga 

g 



ctggagtttc 
etcegttgag 
gctttttctt 
aagecaagge 
taggaccaga 
ccacgctaag 



tgacgactcc 
tccacttggc 
ctgcccacac 
caagagggat 
ggttgtgttt 
ccgaattcca 



ttgttcacca 
tttctgtcct 
cttcttcgac 
gecaagatga 
gctccacctt 
gcacactggc 



aatgeaccat 60 
ccacagctcc 120 
tgttgactgc 180 
tcagccattc 240 
cttgactccc 300 
ggcccgttac 360 
371 



<210> 146 
<211> 355 
<212> DNA 
<213> Homo sapiens 



<400> 146 

ggtcctccgt 

caggatggcg 

ggtacggaag 

eggegaggaa 

cgagagcaag 

tggtcaggaa 



cctcttccca 
agtagcagcg 
ategggtctg 
gtggcagtga 
ctctataaga 
aaagactaca 



gaggtgtcgg 
gctccaaggc 
gctccttcgg 
agctagaatc 
ttcttcaagg 
atgtactagt 



ggcttggccc 
tgaattcatt 
ggacatctat 
teagaaggee 
tggggttggc 
catggatctt 



cagcctccat 
gtcggaggga 
ttggcgatca 
aggcatcccc 
atcccccaca 
ctgggaccta 



cttcgtctct 60 
aatataaact 120 
acatcaccaa 180 
agttgctgta 240 
tacggtggta 300 
geetc 355 



<210> 147 
<211> 355 
<212> DNA 
<213> Homo sapiens 



<400> 147 
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ggtctgttac aaaatgaaga cagacaacac 

tactatgcac gtgctgtgat tttgaacata 

tgacttttta ggttggctga tccatcaatc 

ttgttaggag caaagctgac ctgaacagca 

tttttcccat aatatgggaa atattttaag 

acatttggta tatcttcatt ctttgaaaca 

<210> 148 
<211> 369 
<212> DNA 
<213> Homo sapiens 



aacatttact ctgtggagat atcctactca 60 
actcgtccca aaaacttgtc acgatcatcc 120 
ttgcactcaa ctgttacttc tttcccagtg 180 
accaatggct gtagataccc aacatgcagt 240 
tctatcattc cattatgagg ataaactgct 300 
caatctatcc ttggcactcc ttcag 355 



<400> 148 

aggtctctct ccccctctcc ctctcctgcc 
caccttcctt catgatgtgg gaagagtgct 
agggagtgtg ccgagggctt ctgagaaggt 
atgtggcagc ccctcttctt caagtggctc 
gctgcagcag cctccatcca gcctgaggat 
gaaaagatga gagaagttac agactctcct 
acttcttca 

<210> 149 
<211> 620 
<212> DNA 
<213> Homo sapiens 



agccaagtga agacatgctt acttcccctt 60 
gcaacccagc cctagccaac accgcatgag 120 
ttctctcaca tctagaaaga agcgcttaag 180 
ttgtcctgtt gccctgggag ttctcaaatt 240 
gacatcaata cacagaggaa gaagagtcag 300 
gggcgacccc gagagcttac cattcctcag 360 

369 



<220> 

<221> misc_feature 

<222> 169, 171, 222, 472, 528, 559, 599 
<223> n = A,T,C or G 



<400> 149 

actagtcaaa aatgctaaaa taatttggga gaaaatattt tttaagtagt gttatagttt 60 

catgtttatc ttttattatg ttttgtgaag ttgtgtcttt tcactaatta cctatactat 120 

gccaatattt ccttatatct atccataaca tttatactac atttgtaana naatatgcac 180 

gtgaaactta acactttata aggtaaaaat gaggtttcca anatttaata atctgatcaa 240 

gttcttgtta tttccaaata gaatggactt ggtctgttaa gggctaagga gaagaggaag 300 

ataaggttaa aagttgttaa tgaccaaaca ttctaaaaga aatgcaaaaa aaaagtttat 360 

tttcaagcct tcgaactatt taaggaaagc aaaatcattt cctaaatgca tatcatttgt 420 

gagaatttct cattaatatc ctgaatcatt catttcacta aggctcatgt tnactccgat 480 

atgtctctaa gaaagtacta tttcatggtc caaacctggt tgccatantt gggtaaaggc 540 

tttcccttaa gtgtgaaant atttaaaatg aaattttcct ctttttaaaa attctttana 600 
agggttaagg gtgttgggga 620 

<210> 150 
<211> 371 
<212> DNA 
<213> Homo sapiens 

<400> 150 

ggtccgatca aaacctgcta cctccccaag actttactag tgccgataaa ctttctcaaa 60 

gagcaaccag tatcacttcc ctgtttataa aacctctaac catctctttg ttctttgaac 120 

atgctgaaaa ccacctggtc tgcatgtatg cccgaatttg yaattctttt ctctcaaatg 180 

aaaatttaat tttagggatt catttctata ttttcacata tgtagtatta ttatttcctt 240 

atatgtgtaa ggtgaaattt atggtatttg agtgtgcaag aaaatatatt tttaaagctt 300 

tcatttttcc cccagtgaat gatttagaat tttttatgta aatatacaga atgttttttc 360 

ttacttttat a 371 
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<210> 151 

<211> 4655 

<212> DNA 

<213> Homo sapiens 

<400> 151 

gggacttgag ttctgttatc ttcttaagta 
gggttggcaa aatcctggag ccagaagaaa 
acatgttgta cctggaaaac aatgcccaga 
tggggctcct gaacagcatg gaccagcaga 
ataacacaga ccacgcgcag aacagcgtca 
ccaccttcga tgctctctct ccatcacccg 
cgcacagttt cgacgtgtcc ttccagcagt 
attccactga actgaagaaa ctctactgcc 
aggtgatgac cccacctcct cagggagctg 
ctgagcacgt cacggaggtg gtgaagcggt 
acgagggaca gattgcccct yctagtcatt 
agtatgtaga agatcccatc acaggaagac 
aggttggcac tgaattcacg acagtcttgt 
gagggatgaa ccgccgtcca attttaatca 
tcctgggccg acgctgcttt gaggcccgga 
cggatgaaga tagcatcaga aagcagcaag 
cgaagcgccc gtttcgtcag aacacacatg 
gatccccaga tgatgaactg gtatacttac 
tggtgaagat caaagagtcc ctggaactca 
cgtacaggca acagcaacag cagcagcacc 
agtctccatc ttcatatggt aacagctccc 
agctgccttc tgtgagccag cttatcaacc 
ccattcctga tggcatggga gccaacattc 
gagacatgaa tggactcagc cccacccagg 
cctcccactg cacaccccca cctccgtatc 
cgaggttggg ctgttcatca tgtctggact 
atcagattga gcattactcc atggatgatc 
gacatgcgat ctggaagggc atcctggacc 
ctcatctcct gcggacccca agcagtgcct 
ggggtgagcg tgttattgat gctgtgcgat 
cccgagatga gtggaatgac ttcaactttg 
gcatcaaaga ggagggggag tgagcctcac 
tgccagcccc ctaaaagcac tcctgcttaa 
ttcctcttgt ctgatttctt aggggaagga 
atctgacctg gcatctaatt ctgattctgg 
actgtagctt gccatggcta ggtagaagtg 
tgcagagatt tctcattgac ttttataaag 
atataaatgt ataaatatac agtatagatt 
aatgtaattt aaatgaaaga aaattgagtt 
ttttggatgg cttgtctata ctccttccct 
tagagcttaa tgctacatgt gagtgacgat 
ctaaatacat gccacatcaa acctttgagt 
aagactgtag atatgtattc ttttctcagt 
tctagtgatg atggttcacg ttggggtgat 
caaacgtcct ctttagtttt tggttgggaa 
ccagttcaaa aacacccgac gtcatgtatt 
taccagatac cttatcttac aatattgatt 
ttaaaactaa atttcactac tagattgact 
gaattctgat tgatttgatt gggatgaatg 
actgtctatt aatattcagg gtaaatagga 
cagtaagata tctcaatgaa ccataaattc 
aatattgttt ggtaaatgtt tcttttgttt 



gattcatatt gtaagggtct cggggtgggg 60 
ggacagcagc attgatcaat cttacagcta 120 
ctcaatttag tgagccacag tacacgaacc 180 
ttcagaacgg ctcctcgtcc accagtccct 240 
cggcgccctc gccctacgca cagcccagct 300 
ccatcccctc caacaccgac tacccaggcc 3 60 
cgagcaccgc caagtcggcc acctggacgt 420 
aaattgcaaa gacatgcccc atccagatca 480 
ttatccgcgc catgcctgtc tacaaaaaag 540 
gccccaacca tgagctgagc cgtgaattca 600 
tgattcgagt agaggggaac agccatgccc 660 
agagtgtgct ggtaccttat gagccacccc 720 
acaatttcat gtgtaacagc agttgtgttg 780 
ttgttactct ggaaaccaga gatgggcaag 84 0 
tctgtgcttg cccaggaaga gacaggaagg 900 
tttcggacag tacaaagaac ggtgatggta 960 
gtatccagat gacatccatc aagaaacgaa 1020 
cagtgagggg ccgtgagact tatgaaatgc 1080 
tgcagtacct tcttcagcac acaattgaaa 1140 
agcacttact tcagaaacag acctcaatac 1200 
cacctctgaa caaaatgaac agcatgaaca 12 60 
ctcagcagcg caacgccctc actcctacaa 1320 
ccatgatggg cacccacatg ccaatggctg 1380 
cactccctcc cccactctcc atgccatcca 14 4 0 
ccacagattg cagcattgtc agtttcttag 1500 
atttcacgac ccaggggctg accaccatct 15 60 
tggcaagtct gaaaatccct gagcaatttc 1620 
accggcagct ccacgaattc tcctcccctt 1680 
ctacagtcag tgtgggctcc agtgagaccc 17 4 0 
tcaccctccg ccagaccatc tctttcccac 18 00 
acatggatgc tcgccgcaat aagcaacagc 18 60 
catgtgagct cttcctatcc ctctcctaac 1920 
tcttcaaagc cttctcccta gctcctcccc 1980 
gaagtaagag gcttacttct taccctaacc 2040 
ctttaagcct tcaaaactat agcttgcaga 2100 
agcaaaaaag agttgggtgt ctccttaagc 2160 
catgttcacc cttatagtct aagactatat 2220 
tttgggtggg gggcattgag tattgtttaa 2280 
gcacttattg accatttttt aatttacttg 2340 
taaggggtat catgtatggt gataggtatc 2400 
gatgtacaga ttctttcagt tctttggatt 24 60 
agatccattt ccattgctta ttatgtaggt 2520 
gttggtatat tttatattac tgacatttct 2580 
ttaatccagt tataagaaga agttcatgtc 2640 
tgaggaaaat tcttaaaagg cccatagcag 2700 
tgagcatatc agtaaccccc ttaaatttaa 27 60 
gggaaaacat ttgctgccat tacagaggta 2820 
aactcaaata cacatttgct actgttgtaa 2880 
ccatctatct agttctaaca gtgaagtttt 2940 
atcattcaga aatgttgagt ctgtactaaa 3000 
aactttgtaa aaatcttttg aagcatagat 3060 
ggtaaatgtt tcytttaaag accctcctat 3120 
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tctataaaac tctgcatgta gaggcttgtt tacctttctc tctctaaggt ttacaatagg 3180 
agtggtgatt tgaaaaatat aaaattatga gattggtttt cctgtggcat aaattgcatc 3240 
actgtatcat tttctttttt aaccggtaag agtttcagtt tgttggaaag taactgtgag 3300 
aacccagttt cccgtccatc tcccttaggg actacccata gacatgaaag gtccccacag 3360 
agcaagagat aagtctttca tggctgctgt tgcttaaacc acttaaacga agagttccct 3420 
tgaaactttg ggaaaacatg ttaatgacaa tattccagat ctttcagaaa tataacacat 3480 
ttttttgcat gcatgcaaat gagctctgaa atcttcccat gcattctggt caagggctgt 3540 
cattgcacat aagcttccat tttaatttta aagtgcaaaa gggccagcgt ggctctaaaa 3600 
ggtaatgtgt ggattgcctc tgaaaagtgt gtatatattt tgtgtgaaat tgcatacttt 3660 
gtattttgat tatttttttt ttcttcttgg gatagtggga tttccagaac cacacttgaa 3720 
accttttttt atcgtttttg tattttcatg aaaataccat ttagtaagaa taccacatca 3780 
aataagaaat aatgctacaa ttttaagagg ggagggaagg gaaagttttt ttttttatta 3840 
tttttttaaa attttgtatg ttaaagagaa tgagtccttg atttcaaagt tttgttgtac 3900 
ttaaatggta ataagcactg taaacttctg caacaagcat gcagctttgc aaacccatta 3960 
aggggaagaa tgaaagctgt tccttggtcc tagtaagaag acaaactgct tcccttactt 4020 
tgctgagggt ttgaataaac ctaggacttc cgagctatgt cagtactatt caggtaacac 4080 
tagggccttg gaaatccctg tactgtgtct catggatttg gcactagcca aagcgaggca 4140 
ccccttactg gcttacctcc tcatggcagc ctactctcct tgagtgtatg agtagccagg 4200 
gtaaggggta aaaggatagt aagcatagaa accactagaa agtgggctta atggagttct 4260 
tgtggcctca gctcaatgca gttagctgaa gaattgaaaa gtttttgttt ggagacgttt 4320 
ataaacagaa atggaaagca' gagttttcat taaatccttt tacctttttt ttttcttggt 4380 
aatcccctaa aataacagta tgtgggatat tgaatgttaa agggatattt ttttctatta 4440 
tttttataat tgtacaaaat taagcaaatg ttaaaagttt tatatgcttt attaatgttt 4500 
tcaaaaggta ttatacatgt gatacatttt ttaagcttca gttgcttgtc ttctggtact 4560 
ttctgttatg ggcttttggg gagccagaag ccaatctaca atctcttttt gtttgccagg 4620 
acatgcaata aaatttaaaa aataaataaa aacta 4655 

<210> 152 
<211> 586 - 
<212> PRT 

<213> Homo sapiens 
<400> 152 



Met 


Leu 


Tyr 


Leu 


Glu 


Asn 


Asn 


Ala 


Gin 


Thr Gin Phe Ser Glu Pro Gin 


1 






5 










10 15 


Tyr 


Thr 


Asn 


Leu 


Gly 


Leu 


Leu 


Asn 


Ser 


Met Asp Gin Gin He Gin Asn 






20 










25 


30 • 


Gly 


Ser 


Ser 


Ser 


Thr 


Ser 


Pro 


Tyr 


Asn 


Thr Asp His Ala Gin Asn Ser 




35 










40 




45 


Val 


Thr 


Ala 


Pro 


Ser 


Pro 


Tyr 


Ala 


Gin 


Pro Ser Ser Thr Phe Asp Ala 




50 










55 






60 


Leu 


Ser 


Pro 


Ser 


Pro 


Ala 


He 


Pro 


Ser 


Asn Thr Asp Tyr Pro Gly Pro 


65 










70 








75 80 


His 


Ser 


Phe 


Asp 


Val 


Ser 


Phe 


Gin 


Gin 


Ser Ser Thr Ala Lys Ser Ala 










85 










90 95 


Thr 


Trp 


Thr 


Tyr 


Ser 


Thr 


Glu 


Leu 


Lys 


Lys Leu Tyr Cys Gin He Ala 








100 










105 


110 


Lys 


Thr 


Cys 


Pro 


He 


Gin 


He 


Lys 


Val 


Met Thr Pro Pro Pro Gin Gly 




115 










120 




125 


Ala 


Val 


He 


Arg 


Ala 


Met 


Pro 


Val 


Tyr 


Lys Lys Ala Glu His Val Thr 




130 










135 






140 


Glu 


Val 


Val 


Lys 


Arg 


Cys 


Pro 


Asn 


His 


Glu Leu Ser Arg Glu Phe Asn 


145 










150 








155 160 


Glu 


Gly 


Gin 


He 


Ala 


Pro 


Ser 


Ser 


His 


Leu He Arg Val Glu Gly Asn 










165 










170 175 


Ser 


His 


Ala 


Gin 


Tyr 


Val 


Glu 


Asp 


Pro 


He Thr Gly Arg Gin Ser Val 








180 










185 


190 


Leu 


Val 


Pro 


Tyr 


Glu 


Pro 


Pro 


Gin 


Val 


Gly Thr Glu Phe Thr Thr Val 
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195 200 205 

Leu Tyr Asn Phe Met Cys Asn Ser Ser Cys Val Gly Gly Met Asn Arg 

210 215 220 

Arg Pro lie Leu lie lie Val Thr Leu Glu Thr Arg Asp Gly Gin Val 
225 230 235 240 

Leu Gly Arg Arg Cys Phe Glu Ala Arg He Cys Ala Cys Pro Gly Arg 

245 250 " 255 

Asp Arg Lys Ala Asp Glu Asp Ser He Arg Lys Gin Gin Val Ser Asp 

260 265 ~ 270 

Ser Thr Lys Asn Gly Asp Gly Thr Lys Arg Pro Phe Arg Gin Asn Thr 

275 280 285 

His Gly He Gin Met Thr Ser He Lys Lys Arg Arg Ser Pro Asp Asp 

290 295 300 

Glu Leu Val Tyr Leu Pro Val Arg Gly Arg Glu Thr Tyr Glu Met Leu 
305 310 315 320 

Val Lys He Lys Glu Ser Leu Glu Leu Met Gin Tyr Leu Leu Gin His 

325 330 335 

Thr He Glu Thr Tyr Arg Gin Gin Gin Gin Gin Gin His Gin His Leu 

340 345 350 

Leu Gin Lys Gin Thr Ser He Gin Ser Pro Ser Ser Tyr Gly Asn Ser 

355 360 365 

Ser Pro Pro Leu Asn Lys Met Asn Ser Met Asn Lys Leu Pro Ser Val 

370 375 380 

Ser Gin Leu lie Asn Pro Gin Gin Arg Asn Ala Leu Thr Pro Thr Thr 
3Q 5 390 395 400 

He Pro Asp Gly Met Gly Ala Asn He Pro Met Met Gly Thr His Met 

405 410 ~ 415 

Pro Met Ala Gly Asp Met Asn Gly Leu Ser Pro Thr Gin Ala Leu Pro 

420 425 430 

Pro Pro Leu Ser Met Pro Ser Thr Ser His Cys Thr Pro Pro Pro Pro 

435 440 ~ 445 

Tyr Pro Thr Asp Cys Ser lie Val Ser Phe Leu Ala Arg Leu Gly Cys 

450 455 460 

Ser Ser Cys Leu Asp Tyr Phe Thr Thr Gin Gly Leu Thr Thr He Tyr 
46 5 470 475 480 

Gin He Glu His Tyr Ser Met Asp Asp Leu Ala Ser Leu Lys He Pro 

485 490 495 

Glu Gin Phe Arg His Ala He Trp Lys Gly lie Leu Asp His Arg Gin 

500 505 * 510 

Leu His Glu Phe Ser Ser Pro Ser His Leu Leu Arg Thr Pro Ser Ser 

515 520 525 

Ala Ser Thr Val Ser Val Gly Ser Ser Glu Thr Arg Gly Glu Arg Val 

530 535 540 

He Asp Ala Val Arg Phe Thr Leu Arg Gin Thr He Ser Phe Pro Pro 
545 550 555 560 

Arg Asp Glu Trp Asn Asp Phe Asn Phe Asp Met Asp Ala Arg Arg Asn 

565 570 575 

Lys Gin Gin Arg He Lys Glu Glu Gly Glu 
580 585 



<210> 153 

<211> 2007 

<212> DNA 

<213> Homo sapiens 

<400> 153 

gaattcgtcg ctgctccagg gaaagttctg ttactccact gactctctct tttcctgata 60 
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acatggccag caagaaagta attacagtgt ttggagcaac aggagctcaa ggtggctctg 120 

tggccagggc aattttggag agcaaaaaat ttgcagtgag agcagtgacc agggatgtga 180 

cttgaccaaa tgccctggag ctccagcgcc ttggagctga ggtggtcaaa ggtgacctga 24 0 
atgataaagc atcggtggac agtgccttaa aaggtgtcta tggggccttc ttggtgacca 300 
acttctggga ccctctcaac caagataagg aagtgtgtcg ggggaagctg gtggcagact 360 

ccgccaagca cctgggtctg aagcacgtgg tgtacagcgg cctggagaac gtcaagcgac 420 

tgacggatgg caagctggag gtgccgcact ttgacagcaa gggcgaggtg gaggagtact 480 

tctggtccat tggcatcccc atgaccagtg tccgcgtggc ggcctacttt gaaaactttc 540 

tcgcggcgtg gcggcccgtg aaagcctctg atggagatta ctacaccttg gctgtaccga 600 

tgggagatgt accaatggat ggtatctctg ttgctgatat tggagcagcc gtctctagca 660 

tttttaattc tccagaggaa tttttaggca aggccgtggg gctcagtgca gaagcactaa 720 

caatacagca atatgctgat gttttgtcca aggctttggg gaaagaagtc cgagatgcaa 780 

agattacccc ggaagctttc gagaagctgg gattccctgc agcaaaggaa atagccaata 840 

tgtgtcgttt ctatgaaatg aagccagacc gagatgtcaa tctcacccac caactaaatc 900 

ccaaagtcaa aagcttcagc cagtttatct cagagaacca gggagccttc aagggcatgt 960 

agaaaatcag ctgttcagat aggcctctgc accacacagc ctctttcctc tctgatcctt 1020 

ttcctcttta cggcacaaca ttcatgttga cagaacatgc tggaatgcaa ttgtttgcaa 1080 

caccgaagga tttcctgcgg tcgcctcttc agtaggaagc actgcattgg tgataggaca 1140 

cggtaatttg attcacattt aacttgctag ttagtgataa gggtggtaca actgtttggt 1200 

aaaatgagaa gcctcggaac ttggagcttc tctcctacca ctaatgggag ggcagattat 1260 

actgggattt ctcctgggtg agtaatttca agccctaatg ctgaaattcc cctaggcagc 1320 

tccagttttc tcaactgcat tgcaaaattc ccagtgaact tttaagtact tttaacttaa 1380 

aaaaatgaac atctttgtag agaattttct ggggaacatg gtgttcaatg aacaagcaca 1440 

agcattggaa atgctaaaat tcagttttgc ctcaagattg gaagtttatt ttctgactca 1500 

ttcatgaagt catctattga gccaccattc aattattcat ctattaattc cttgatcctt 1560 

catttatcca ttctgcaaac ttttcttgag caccagcacg ggtggccatt tgtggacttc 1620 

tcttcattcc tatgtgtttt cttatcaaag tgatccactc tcgaaaggct cctttccagt 1680 

ctgtggttgg gttcaagtca tgccagggcc agggggccca tctcctcgtt tagctctagg 1740 

caaaatccag gggatctgca gtggggagcg ggggcaggaa gctggaggga aggcctgtga 1800 

agggtaggga tgtggaaaga caaggtgaca gaaggaccca ataggacctt tctatatctc 1860 

tggcttagca ttttctacat catattgtaa tcgtcttatt tgctagtttt cttccttact 1920 

gtgagtgact aacagtcatc tttatcccag tgcctggtac ataataagtg atcaataaat 1980 
gttgattgac taaaaaaaaa aaaaaaa 2007 

<210> 154 
<211> 2148 
<212> DNA 

<213> Homo sapiens 
<400> 154 

gaattcgtcg ctgctccagg gaaagttctg ttactccact gactctctct tttcctgata 60 
acatggccag caagaaagta attacagtgt ttggagcaac aggagctcaa ggtggctctg 120 
tggccagggc aattttggag agcaaaaaat ttgcagtgag agcagtgacc agggatgtga 180 
cttgaccaaa tgccctggag ctccagcgcc ttggagctga ggtggtcaaa ggtgacctga 240 
atgataaagc atcggtggac agtgccttaa aaggggaagc tggtggcaga ctccgccaag 300 
cacctgggtc tgaagcacgt ggtgtacagc ggcctggaga acgtcaagcg actgacggat 360 
ggcaagctgg aggtgccgca ctttgacagc aagggcgagg tggaggagta cttctggtcc 420 
attggcatcc ccatgaccag tgtccgcgtg gcggcctact ttgaaaactt tctcgcggcg 480 
tggcggcccg tgaaagcctc tgatggagat tactacacct tggctgtacc gatgggagat 54 0 
gtaccaatgg atggtatctc tgttgctgat attggagcag ccgtctctag catttttaat 600 
tctccagagg aatttttagg caaggccgtg gggctcagtg cagaagcact aacaatacag 660 
caatatgctg atgttttgtc caaggctttg gggaaagaag tccgagatgc aaagactatc 720 
tgtgctatag atgaccagaa aacagtggaa gaaggtttca tggaagacgt gggcttgagt 780 
tggtccttga gggaacatga ccatgtatag acagaggagg catcaagaag gctggcctgg 840 
ctaattctgg aataaacacg acaaaccaga ggcagtacgg gaaggaggca aattctggct 900 
ctgcctctat ccttgattac cccggaagct ttcgagaagc tgggattccc tgcagcaaag 960 
gaaatagcca atatgtgtcg tttctatgaa atgaagccag accgagatgt caatctcacc 1020 
caccaactaa atcccaaagt caaaagcttc agccatttta tctcagagaa ccagggagcc 1080 
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ttcaagggca tgtagaaaat cagctgttca gataggcctc tgcaccacac agcctctttc 1140 
ctctctgatc cttttcctct ttacggcaca acattcatgt tgacagaaca tgctggaatg 1200 
caattgtttg caacaccgaa ggatttcctg cggtcgcctc ttcagtagga agcactgcat 1260 
tggtgatagg acacggtaat ttgattcaca tttaacttgc tagttagtga taagggtggt 1320 
acaactgttt ggtaaaatga gaagcctcgg aacttggagc ttctctccta ccactaatgg 1380 
gagggcagat tatactggga tttctcctgg gtgagtaatt tcaagcccta atgctgaaat 1440 
tcccctaggc agctccagtt ttctcaactg cattgcaaaa ttcccagtga acttttaagt 1500 
acttttaact taaaaaaatg aacatctttg tagagaattt tctggggaac atggtgttca 1560 
atgaacaagc acaagcattg gaaatgctaa aattcagttt tgcctcaaga ttggaagttt 1620 
attttctgac tcattcatga agtcatctat tgagccacca ttcaattatt catctattaa 1680 
ttccttgatc cttcatttat ccattctgca aacttttctt gagcaccagc acgggtggcc 1740 
atttgtggac ttctcttcat tcctatgtgt tttcttatca aagtgatcca ctctcgaaag 1800 
gctcctttcc agtctgtggt tgggttcaag tcatgccagg gccagggggc ccatctcctc 1860 
gtttagctct aggcaaaatc caggggatct gcagtgggga gcgggggcag gaagctggag 1920 
ggaaggcctg tgaagggtag ggatgtggaa agacaaggtg acagaaggac ccaataggac 1980 
ctttctatat ctctggctta gcattttcta catcatattg taatcgtctt atttgctagt 2040 
tttcttcctt actgtgagtg actaacagtc atctttatcc cagtgcctgg tacataataa 2100 
gtgatcaata aatgttgatt gactaaatga aaaaaaaaaa aaaaaaaa 2148 

<210> 155 
<211> 153 
<212> PRT 
<213> Homo sapiens 



<400> 155 



Met Thr Ser Val 


Arg Val 


Ala 


Ala 


Tyr 


Phe 


Glu Asn Phe Leu Ala Ala 


1 


5 










10 


15 


Trp Arg Pro Val 


Lys 


Ala 


Ser 


Asp 


Gly 


Asp 


Tyr Tyr Thr Leu Ala Val 


20 










25 




30 


Pro Met Gly Asp 


Val 


Pro 


Met 


Asp 


Gly 


He 


Ser Val Ala Asp He Gly 


35 








40 






45 


Ala Ala Val Ser 


Ser 


lie 


Phe 


Asn 


Ser 


Pro 


Glu Glu Phe Leu Gly Lys 


50 






55 








60 


Ala Val Gly Leu 


Ser 


Ala 


Glu 


Ala 


Leu 


Thr 


He Gin Gin Tyr Ala Asp 


65 




70 










75 " 80 


Val Leu Ser Lys 


Ala 


Leu 


Gly 


Lys 


Glu 


Val 


Arg Asp Ala Lys lie Thr 




85 










90 


95 


Pro Glu Ala Phe 


Glu 


Lys 


Leu 


Gly 


Phe 


Pro 


Ala Ala Lys Glu He Ala 


100 










105 




110 


Asn Met Cys Arg 


Phe 


Tyr 


Glu 


Met 


Lys 


Pro 


Asp Arg Asp Val Asn Leu 


115 








120 






125 • 


Thr His Gin Leu 


Asn 


Pro 


Lys 


Val 


Lys 


Ser 


Phe Ser Gin Phe He Ser 


130 






135 








140 


Glu Asn Gin Gly 


Ala 


Phe 


Lys 


Gly 


Met 






145 




150 












<210> 156 
















<211> 128 
















<212> PRT 
















<213> Homo sapiens 














<400> 156 
















Met Thr Ser Val Arg Val 


Ala 


Ala 


Tyr 


Phe 


Glu Asn Phe Leu Ala Ala 


1 


5 










10 


15 


Trp Arg Pro Val 


Lys Ala 


Ser 


Asp 


Gly 


Asp 


Tyr Tyr Thr Leu Ala Val 


20 










25 




30 


Pro Met Gly Asp Val 


Pro 


Met 


Asp 


Gly 


He 


Ser Val Ala Asp He Gly 
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35 40 45 

Ala Ala Val Ser Ser lie Phe Asn Ser Pro Glu Glu Phe Leu Gly Lys 

50 55 60 

Ala Val Gly Leu Ser Ala Glu Ala Leu Thr lie Gin Gin Tyr Ala Asp 
65 70 75 ' 80 

Val Leu Ser Lys Ala Leu Gly Lys Glu Val Arg Asp Ala Lys Thr lie 

85 90 " 95 

Cys Ala lie Asp Asp Gin Lys Thr Val Glu Glu Gly Phe Met Glu Asp 

100 105 110 

Val Gly Leu Ser Trp Ser Leu Arg Glu His Asp His Val Ala Gly Ala 

115 120 125 



<210> 157 
<211> 424 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 320, 322 
<223> n = A f T,C or G 

<400> 157 

ctgcagcccg ggggatccac tagtccagtg 
ggatacatta cagcagaeat ggaaatataa 
aattcagtca ccactgttat attaccttct 
tattagattt ccttgtatgc aaagtttttg 
agaggaagga gaaaactgca tcataacttt 
agcccagaaa cttctctgcn gnatctggct 
ccccagccat cgagtcagtt tgtgcccatg 
tgct 

<210> 158 

<211> 2099 

<212> DNA 

<213> Homo sapiens 



tggtggaatt cattggtctt tacaagactt 60 
ttttaaaaaa tttctctcca acctccttca 120 
ccaggaaccc tccagtgggg aaggctgcga 180 
ttgaaagctg tgctcagagg aggtgagagg 24 0 
acagaattga atctagagtc ttccccgaaa 300 
tgtccatctg gtctaaggtg gctgcttctt 360 
aataatacac gacctgctat ttcccatgac 420 

424 



<400> 158 

ccgcggttaa aaggcgcagc aggtgggagc 
ccgacagccg gcggcgcccg agcccgacct 
ccgcgcagag cccgcgccag ggccgccggc 
aaggcacttc ctgtcggtga agaagacctg 
caaacggggc tgacctccct tcctggggag 
agaagatctg gctaaacaat ttctgtatgg 
ttcatgcatc tttaattcaa tttgaatatt 
attgacattc gtatcatcac tgtgcaccat 
aaggaggtct gaaaccctcg cagagggatc 
gcagtcgttg gaaacaggac tcagggataa 
actttcatcg ggggtgtcaa caaacactcc 
atctttattt tccgagtcat gatcctcgtg 
caagaggact tcgtctgcaa cacactgcaa 
tttttcccgg tgtcccacat ccggctgtgg 
gcgctgctgg tggccatgca tgtggcctac 
cgaggagaga agaggaatga tttcaaagac 
atagaggggt cgctgtggtg gacgtacacc 
gcagccttta tgtatgtgtt ttacttcctt 
aaatgtggga ttgacccctg ccccaacctt 



cggggccttc acccgaaacc cgacgagagc 60 • 
gcctgcccag ccggagcgaa gggcgccgcc 120 
cgcagagcag ttaaaacgtg caggcaccag 180 
tctccggtgt cacgggcatc ctgtgttttg 24 0 
caggaagggt cagggaagga aaagaagtac 300 
cgaaagaaaa attctaactt gtacgccctc 360 
ccaggcgaca tcctcactga ccgagcaaag 420 
tggcttctag gcactccagt ggggtaggag 480 
ttgccctcat tctttgggtc tgaaacactg 540 
accagcgcaa tggattgggg gacgctgcac 600 
accagcatcg ggaaggtgtg gatcacagtc 660 
gtggctgccc aggaagtgtg gggtgacgag 720 
ccgggatgca aaaatgtgtg ctatgaccac 780 
gccctccagc tgatcttcgt ctccacccca 840 
tacaggcacg aaaccactcg caagttcagg 900 
atagaggaca ttaaaaagca gaaggttcgg 960 
agcagcatct ttttccgaat catctttgaa 1020 
tacaatgggt accacctgcc ctgggtgttg 1080 
gttgactgct ttatttctag gccaacagag 1140 
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aagaccgtgt ttaccatttt tatgatttct gcgtctgtga tttgcatgct gcttaacgtg 1200 

gcagagttgt gctacctgct gctgaaagtg tgttttagga gatcaaagag agcacagacg 12 60 

caaaaaaatc accccaatca tgccctaaag gagagtaagc agaatgaaat gaatgagctg 1320 

atttcagata gtggtcaaaa tgcaatcaca ggttcccaag ctaaacattt caaggtaaaa 1380 

tgtagctgcg tcataaggag acttctgtct tctccagaag gcaataccaa cctgaaagtt 144 0 

ccttctgtag cctgaagagt ttgtaaatga ctttcataat aaatagacac ttgagttaac 1500 

tttttgtagg atacttgctc cattcataca caacgtaatc aaatatgtgg tccatctctg 1560 

aaaacaagag actgcttgac aaaggagcat tgcagtcact ttgacaggtt ccttttaagt 1620 

ggactctctg acaaagtggg tactttctga aaatttatat aactgttgtt gataaggaac 1680 

atttatccag gaattgatac gtttattagg aaaagatatt tttataggct tggatgtttt 1740 

tagttctgac tttgaattta tataaagtat ttttataatg actggtcttc cttacctgga 1800 

aaaacatgcg atgttagttt tagaattaca ccacaagtat ctaaatttgg aacttacaaa 1860 

gggtctatct tgtaaatatt gttttgcatt gtctgttggc aaatttgtga actgtcatga 1920 

tacgcttaag gtggaaagtg ttcattgcac aatatatttt tactgctttc tgaatgtaga 1980 

cggaacagtg tggaagcaga aggctttttt aactcatccg tttgccaatc attgcaaaca 2040 

actgaaatgt ggatgtgatt gcctcaataa agctcgtccc cattgcttaa aaaaaaaaa 2099 

<210> 159 
<211> 291 
<212> PRT 

<213> Homo sapiens 



<400> 159 





Asp 


Trp 




Thr 


Leu 


EI -5 a 

H.2.S 


inr 


pne 


lie 


biy 




val 


Asn 


T i ret 

Lys 


His 


1 








5 










10 










15 




Ser 


Thr 


Ser 


He 


Gly 


Lys 


Val 


Trp 


He 


Thr 


Val 


He 


Phe 


He 


Phe 


Arg 








20 










25 










30 






Val 


Met 


He 


Leu 


Val 


Val 


Ala 


Ala 


Gin 


Glu 


Val 


Trp 


Gly 


Asp 


,Glu 


Gin 






35 










40 










45 








Glu 


Asp 


Phe 


Val 


Cys 


Asn 


Thr 


Leu 


Gin 


Pro 


Gly 


Cys 


Lys 


Asn 


Val 


Cys 




50 










55 










60 










Tyr 


Asp 


His 


Phe 


Phe 


Pro 


Val 


Ser 


His 


He 


Arg 


Leu 


Trp 


Ala 


Leu 


Gin 


65 










70 










75 










80 


Leu 


He 


Phe 


Val 


Ser 


Thr 


Pro 


Ala 


Leu 


Leu 


Val 


Ala 


Met 


His 


Val 


Ala 










85 










90 










95 




Tyr 


Tyr 


Arg 


His 


Glu 


Thr 


Thr 


Arg 


Lys 


Phe 


Arg 


Arg 


Gly 


Glu 


Lys 


Arg 








100 










105 










110 






Asn 


Asp 


Phe 


Lys 


Asp 


He 


Glu 


Asp 


He 


Lys 


Lys 


Gin 


Lys 


Val 


Arg 


He 






115 










120 










125 








Glu 


Gly 


Ser 


Leu 


Trp 


Trp 


Thr 


Tyr 


Thr 


Ser 


Ser 


He 


Phe 


Phe 


Arg 


He 




130 










135 










140 










lie 


Phe 


Glu 


Ala 


Ala 


Phe 


Met 


Tyr 


Val 


Phe 


Tyr 


Phe 


Leu 


Tyr 


Asn 


Gly 


145 










150 










155 










160 


Tyr 


His 


Leu 


Pro 


Trp 


Val 


Leu 


Lys 


Cys 


Gly 


He 


Asp 


Pro 


Cys 


Pro 


Asn 










165 










170 










175 




Leu 


Val 


Asp 


Cys 


Phe 


He 


Ser 


Arg 


Pro 


Thr 


Glu 


Lys 


Thr 


Val 


Phe 


Thr 








180 










185 










190 






lie 


Phe 


Met 


He 


Ser 


Ala 


Ser 


Val 


He 


Cys 


Met 


Leu 


Leu 


Asn 


Val 


Ala 






195 










200 










205 








Glu 


Leu 


Cys 


Tyr 


Leu 


Leu 


Leu 


Lys 


Val 


Cys 


Phe 


Arg 


Arg 


Ser 


Lys 


Arg 




210 










215 










220 










Ala 


Gin 


Thr 


Gin 


Lys 


Asn 


His 


Pro 


Asn 


His 


Ala 


Leu 


Lys 


Glu 


Ser 


Lys 


225 










230 










235 










240 


Gin 


Asn 


Glu 


Met 


Asn 


Glu 


Leu 


He 


Ser 


Asp 


Ser 


Gly 


Gin 


Asn 


Ala 


He 










245 










250 










255 




Thr 


Gly 


Ser 


Gin 


Ala 


Lys 


His 


Phe 


Lys 


Val 


Lys 


Cys 


Ser 


Cys 


Val 


He 








260 










265 










270 






Arg 


Arg 


Leu 


Leu 


Ser 


Ser 


Pro 


Glu 


Gly 


Asn 


Thr 


Asn 


Leu 


Lys 


Val 


Pro 
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275 280 
Ser Val Ala 
290 



<210> 160 

<211> 3951 

<212> DNA 

<213> Homo sapiens 

<400> 160 

tctgcatcca tattgaaaac ctgacacaat 
gaggcttctc tacaacatga cccaaaggag 
tgtgactctc ctggttgcct taagttcaga 
tcaagacaat gggtataatg gattgctcat 
gaacctcatc tcaaacatta aggaaatgat 
taccaagaga agagtatttt tcagaaatat 
taataataac agcaaaataa aacaagaatc 
ctggtatggg gcacatggag atgatccata 
gggaaaatac attcatttca cacctaattt 
cggatcacga ggccgagtgt ttgtccatga 
tgagtataac aatgacaaac ctttctacat 
gtgttcatct gacatcacag gcatttttgt 
ctgtattatt agtaagcttt ttaaagaagg 
tgcaactgca tcaataatgt tcatgcaaag 
aagtacccac aaccaagaag caccaaacct 
atgggatgta atcacagact ctgctgactt 
gcttccacct cctcccacat tctcgcttgt 
gctggatgtg tccagcaaga tggcagaggc 
agaattttat ttgatgcaga ttgttgaaat 
cagcaaagga gagatcagag cccagctaca 
gctggtttca tatctgccca ccactgtatc 
gcttaagaaa ggatttgagg tggttgaaaa 
gatattagtg accagcggag atgataagct 
cagtggttca acaattcact ccattgccct 
attatcacgt cttacaggag gtttaaagtt 
catgattgat gctttcagta gaatttcctc 
tcagcttgaa agtacaggtg aaaatgtcaa 
tgtggataat actgtgggca acgacactat 
tcctgagatt atattatttg atcctgatgg 
caatctaact tttcggacag ctagtctttg 
gacttacacc ctgaacaata cccatcattc 
tcgcgcctcc aactcagctg tgcccccagc 
cctccatttt cctcatcctg tgatgattta 
tcttaatgcc actgtcactg ccacagttga 
actccttgat gatggagcag gtgctgatgt 
ttttttctcc tttgctgcaa atggtagata 
cagcataagc accccagccc actctattcc 
cacagcaaac ggtaatattc agatgaatgc 
ggagcgaaag tggggcttta gccgagtcag 
tccagctggc ccccaccctg atgtgtttcc 
aaaagtagaa gaggaattga ccctatcttg 
ccaggctaca agctatgaaa taagaatgag 
taacaatgct attttagtaa atacatcaaa 
gatatttacg ttctcacccc aaatttccac 
aacacatgaa agccacagaa tttatgttgc 
gtctgctgta tctaacattg cccaggcgcc 
acctgccaga gattatctta tattgaaagg 



285 



gtatgcagca ggctcagtgt gagtgaactg 60 
cattgcaggt cctatttgca acctgaagtt 120 
actcccattc ctgggagctg gagtacagct 180 
tgcaattaat cctcaggtac ctgagaatca 240 
aactgaagct tcattttacc tatttaatgc 300 
aaagatttta atacctgcca catggaaagc 360 
atatgaaaag gcaaatgtca tagtgactga 42 0 
caccctacaa tacagagggt gtggaaaaga 480 
cctactgaat gataacttaa cagctggcta 540 
atgggcccac ctccgttggg gtgtgttcga 600 
aaatgggcaa aatcaaatta aagtgacaag 660 
gtgtgaaaaa ggtccttgcc cccaagaaaa 720 
atgcaccttt atctacaata gcacccaaaa 780 
tttatcttct gtggttgaat tttgtaatgc 84 0 
acagaaccag atgtgcagcc tcagaagtgc 900 
tcaccacagc tttcccatga acgggactga- 960 
agaggctggt gacaaagtgg tctgtttagt 1020 
tgacagactc cttcaactac aacaagccgc 1080 
tcataccttc gtgggcattg ccagtttcga 1140 
ccaaattaac agcaatgatg atcgaaagtt 1200 
agctaaaaca gacatcagca tttgttcagg 1260 
actgaatgga aaagcttatg gctctgtgat 1320 
tcttggcaat tgcttaccca ctgtgctcag 1380 
gggttcatct gcagccccaa atctggagga 14 40 
ctttgttcca gatatatcaa actccaatag 1500 
tggaactgga gacattttcc agcaacatat 1560 
acctcaccat caattgaaaa acacagtgac 1620 
gtttctagtt acgtggcagg ccagtggtcc 1680 
acgaaaatac tacacaaata attttatcac 17 40 
gattccagga acagctaagc ctgggcactg 1800 
tctgcaagcc ctgaaagtga cagtgacctc 1860 
cactgtggaa gcctttgtgg aaagagacag 1920 
tgccaatgtg aaacagggat tttatcccat 1980 
gccagagact ggagatcctg ttacgctgag 2040 
tataaaaaat gatggaattt actcgaggta 2100 
tagcttgaaa gtgcatgtca atcactctcc 2160 
agggagtcat gctatgtatg taccaggtta 2220 
tccaaggaaa tcagtaggca gaaatgagga 22 80 
ctcaggaggc tccttttcag tgctgggagt 2340 
accatgcaaa attattgacc tggaagctgt 24 00 
gacagcacct ggagaagact ttgatcaggg 24 60 
taaaagtcta cagaatatcc aagatgactt 2520 
gcgaaatcct cagcaagctg gcatcaggga 2580 
gaatggacct gaacatcagc caaatggaga 2640 
aatacgagca atggatagga actccttaca 27 00 
tctgtttatt ccccccaatt ctgatcctgt 27 60 
agttttaaca gcaatgggtt tgataggaat 2820 
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catttgcctt attatagttg tgacacatca tactttaagc aggaaaaaga gagcagacaa 2880 

gaaagagaat ggaacaaaat tattataaat aaatatccaa agtgtcttcc ttcttagata 2940 

taagacccat ggccttcgac tacaaaaaca tactaacaaa gtcaaattaa catcaaaact 3000 

gtattaaaat gcattgagtt tttgtacaat acagataaga tttttacatg gtagatcaac 3060 

aaattctttt tgggggtaga ttagaaaacc cttacacttt ggctatgaac aaataataaa 3120 

aattattctt taaagtaatg tctttaaagg caaagggaag ggtaaagtcg gaccagtgtc 3180 

aaggaaagtt tgttttattg aggtggaaaa atagccccaa gcagagaaaa ggagggtagg 3240 

tctgcattat aactgtctgt gtgaagcaat catttagtta ctttgattaa tttttctttt 3300 

ctccttatct gtgcagaaca ggttgcttgt ttacaactga agatcatgct atatttcata 3360 

tatgaagccc ctaatgcaaa gctctttacc tcttgctatt ttgttatata tattacagat 3420 

gaaatctcac tgctaatgct cagagatctt ttttcactgt aagaggtaac ctttaacaat 3480. 

atgggtatta cctttgtctc ttcataccgg ttttatgaca aaggtctatt gaatttattt 3540 

gtttgtaagt ttctactccc atcaaagcag ctttttaagt tattgccttg gttattatgg 3600 

atgatagtta tagcccttat aatgccttaa ctaaggaaga aaagatgtta ttctgagttt 3660 

gttttaatac atatatgaac atatagtttt attcaattaa accaaagaag aggtcagcag 3720 

ggagatacta acctttggaa atgattagct ggctctgttt tttggttaaa taagagtctt 3780 

taatcctttc tccatcaaga gttacttacc aagggcaggg gaagggggat atagaggtcc 3840 

caaggaaata aaaatcatct ttcatcttta attttactcc ttcctcttat ttttttaaaa 3900 

gattatcgaa caataaaatc atttgccttt ttaattaaaa acataaaaaa a 3951 

<210> 161 
<211> 943 
<212> PRT 

<213> Homo sapiens 



<400> 161 
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<210> 162 
<211> 498 
<212> DNA 
<213> Homo sapiens 

<400> 162 

tggagaacca cgtggacagc accatgaaca tgttgggcgg gggaggcagt 

agcccctcaa gtcgggtatg aaggagctgg ccgtgttccg ggagaaggtc 

accggcagat gggcaagggt ggcaagcatc accttggcct ggaggagccc 

gaccaccccc tgccaggact ccctgccaac aggaactgga ccaggtcctg 

ccaccatgcg ccttccggat gagcggggcc ctctggagca cctctactcc 

ccaactgtga caagcatggc ctgtacaacc tcaaacagtg gcaagatgtc 

cagcgtgggg agtgctggtg tgtgaacccc aacaccggga agctgatcca 

accatccggg gggaccccga gtgtcatctc ttctacaatg agcagcagga 
gtgcacaccc cagcggat 

<210> 163 

<211> 1128 

<212> DNA 

<213> Homo sapiens 

<400> 163 

gccacctggc cctcctgatc gacgacacac gcacttgaaa cttgttctca 
aatcaacttt ccggaagcaa ccagcccacc agaggaggtc ccgagcgcga 
tgcagcggag actggttcag cagtggagcg tcgcggtgtt cctgctgagc 
cctcctgcgg gcgctcggtg gagggtctca gccgccgcct caaaagagct 
atcagctcct ccatgacaag gggaagtcca tccaagattt acggcgacga 
accatctgat cgcagaaatc cacacagctg aaatcagagc tacctcggag 



gctggccgga 60 
actgagcagc 120 
aagaagctgc 180 
gagcggatct 240 
ctgcacatcc 300 
tctgaacggg 360 
gggagccccc 420 
ggctcgcggg 480 
498 



gggtgtgtgg 60 
gcggagacga 120 
tacgcggtgc 180 
gtgtctgaac 240 
ttcttccttc 300 
gtgtccccta 360 
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actccaagcc ctctcccaac acaaagaacc accccgtccg atttgggtct gatgatgagg 420 
gcagatacct aactcaggaa actaacaagg tggagacgta caaagagcag ccgctcaaga 480 
cacctgggaa gaaaaagaaa ggcaagcccg ggaaacgcaa ggagcaggaa aagaaaaaac 540 
ggcgaactcg ctctgcctgg ttagactctg gagtgactgg gagtgggcta gaaggggacc 600 
acctgtctga cacctccaca acgtcgctgg agctcgattc acggaggcat tgaaattttc 660 
agcagagacc ttccaaggac atattgcagg attctgtaat agtgaacata tggaaagtat 720 
tagaaatatt tattgtctgt aaatactgta aatgcattgg aataaaactg tctcccccat 780 
tgctctatga aactgcacat tggtcattgt gaatattttt ttttttgcca aggctaatcc 84 0 
aattattatt atcacattta ccataattta ttttgtccat tgatgtattt attttgtaaa 900 
tgtatcttgg tgctgctgaa tttctatatt ttttgtaaca taatgcactt tagatataca 960 
tatcaagtat gttgataaat gacacaatga agtgtctcta ttttgtggtt gattttaatg 1020 
aatgcctaaa tataattatc caaattgatt ttcctttgtg catgtaaaaa taacagtatt 1080 
ttaaatttgt aaagaatgtc taataaaata taatctaatt acatcatg 1128 

<210> 164 
<211>' 1310 
<212> DNA 

<213> Homo sapiens 
<400> 164 

gggcctggtt cgcaaagaag ctgacttcag agggggaaac tttcttcttt taggaggcgg 60 
ttagccctgt tccacgaacc caggagaact gctggccaga ttaattagac attgctatgg 120 
gagacgtgta aacacactac ttatcattga tgcatatata aaaccatttt attttcgcta 180 
ttatttcaga ggaagcgcct ctgatttgtt tcttttttcc ctttttgctc tttctggctg 240 
tgtggtttgg agaaagcaca gttggagtag ccggttgcta aataagtccc gagcgcgagc 300 
ggagacgatg cagcggagac tggttcagca gtggagcgtc gcggtgttcc tgctgagcta 360 
cgcggtgccc tcctgcgggc gctcggtgga gggtctcagc cgccgcctca aaagagctgt 420 
gtctgaacat cagctcctcc atgacaaggg gaagtccatc caagatttac ggcgacgatt 480 
cttccttcac catctgatcg cagaaatcca cacagctgaa atcagagcta cctcggaggt 54 0 
gtcccctaac tccaagccct ctcccaacac aaagaaccac cccgtccgat ttgggtctga 600 
tgatgagggc agatacctaa ctcaggaaac taacaaggtg gagacgtaca aagagcagcc 660 
gctcaagaca cctgggaaga aaaagaaagg caagcccggg aaacgcaagg agcaggaaaa 720 
gaaaaaacgg cgaactcgct ctgcctggtt agactctgga gtgactggga gtgggctaga 780 
aggggaccac ctgtctgaca cctccacaac gtcgctggag ctcgattcac ggaggcattg 840 
aaattttcag cagagacctt ccaaggacat attgcaggat tctgtaatag tgaacatatg 900 
gaaagtatta gaaatattta ttgtctgtaa atactgtaaa tgcattggaa taaaactgtc 960 
tcccccattg ctctatgaaa ctgcacattg gtcattgtga atattttttt ttttgccaag 1020 
gctaatccaa ttattattat cacatttacc ataatttatt ttgtccattg atgtatttat 1080 
tttgtaaatg tatcttggtg ctgctgaatt tctatatttt ttgtaacata atgcacttta 1140 
gatatacata tcaagtatgt tgataaatga cacaatgaag tgtctctatt ttgtggttga 1200 
ttttaatgaa tgcctaaata taattatcca aattgatttt cctttgtgcc cgtaaaaata 1260 
acagtatttt aaatttgtaa agaatgtcta ataaaatata atctaattac 1310 

<210> 165 
<211> 177 
<212> PRT 

<213> Homo sapiens 
<400> 165 

Met Gin Arg Arg Leu Val Gin Gin Trp Ser Val 

15 10 
Ser Tyr Ala Val Pro Ser Cys Gly Arg Ser Val 

20 25 
Arg Leu Lys Arg Ala Val Ser Glu His Gin Leu 

35 40 
Lys Ser lie Gin Asp Leu Arg Arg Arg Phe Phe 

50 55 
Ala Glu lie His Thr Ala Glu lie Arg Ala Thr 



Ala Val Phe Leu Leu 
15 

Glu Gly Leu Ser Arg 
30 

Leu His Asp Lys Gly 
45 

Leu His His Leu He 
60 

Ser Glu Val Ser Pro 
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65 70 
Asn Ser Lys Pro Ser Pro Asn Thr 
85 

Ser Asp Asp Glu Gly Arg Tyr Leu 
100 

Thr Tyr Lys Glu Gin Pro Leu Lys 
115 120 
Lys Pro Gly Lys Arg Lys Glu Gin 

130 135 
Ser Ala Trp Leu Asp Ser Gly Val 
145 150 
His Leu Ser Asp Thr Ser Thr Thr 
165 

His 



75 80 
Lys Asn His Pro Val Arg Phe Gly 

90 95 
Thr Gin Glu Thr Asn Lys Val Glu 
105 110 
Thr Pro Gly Lys Lys Lys Lys Gly 
125 

Glu Lys Lys Lys Arg Arg Thr Arg 

140 . 

Thr Gly Ser Gly Leu Glu Gly Asp 
155 160 
Ser Leu Glu Leu Asp Ser Arg Arg 
170 175 



<210> 166 
<211> 177 
<212> PRT 
<213> Homo sapiens 



<400> 166 



Met 


Gin 


Arg Arg 


Leu Val Gin Gin 


Trp Ser 


Val Ala Val Phe Leu Leu 


1 






5 


10 


15 


Ser 


Tyr 


Ala Val 


Pro Ser Cys Gly 


Arg Ser 


Val Glu Gly Leu Ser Arg 






20 




25 


30 


Arg 


Leu 


Lys Arg 


Ala Val Ser Glu 


His Gin 


Leu Leu His Asp Lys Gly 






35 


40 




45 


Lys 


Ser 


lie Gin 


Asp Leu Arg Arg 


Arg Phe 


Phe Leu His His Leu He 




50 




55 




60 


Ala 


Glu 


He His 


Thr Ala Glu He 


Arg Ala 


Thr Ser Glu Val Ser Pro 


65 






70 




75 80 


Asn 


Ser 


Lys Pro 


Ser Pro Asn Thr 


Lys Asn 


His Pro Val Arg Phe Gly 








85 


90 


95 


Ser 


Asp 


Asp Glu 


Gly Arg Tyr Leu 


Thr Gin 


Glu Thr Asn Lys Val Glu 






100 




105 


110 


Thr 


Tyr 


Lys Glu 


Gin Pro Leu Lys 


Thr Pro 


Gly Lys Lys Lys Lys Gly 






115 


120 




125 


Lys 


Pro 


Gly Lys 


Arg Lys Glu Gin 


Glu Lys 


Lys Lys Arg Arg Thr Arg 




130 




135' 




140 


Ser 


Ala 


Trp Leu 


Asp Ser Gly Val 


Thr Gly 


Ser Gly Leu Glu Gly Asp 


145 






150 




155 160 


His 


Leu 


Ser Asp 


Thr Ser Thr Thr 


Ser Leu 


Glu Leu Asp Ser Arg Arg 








165 


170 


175 



His 



<210> 167 

<211> 3362 

<212> DNA 

<213> Homo sapiens 



<400> 167 

cacaatgtat gcagcaggct cagtgtgagt gaactggagg cttctctaca acatgaccca 60 
aaggagcatt gcaggtccta tttgcaacct gaagtttgtg actctcctgg ttgccttaag 120 
ttcagaactc ccattcctgg gagctggagt acagcttcaa gacaatgggt ataatggatt 18 0 
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gctcattgca attaatcctc aggtacctga 
aatgataact gaagcttcat tttacctatt 
aaatataaag attttaatac ctgccacatg 
agaatcatat gaaaaggcaa atgtcatagt 
tccatacacc ctacaataca gagggtgtgg 
taatttccta ctgaatgata acttaacagc 
ccatgaatgg gcccacctcc gttggggtgt 
ctacataaat gggcaaaatc aaattaaagt 
ttttgtgtgt gaaaaaggtc cttgccccca 
agaaggatgc acctttatct acaatagcac 
gcaaagttta tcttctgtgg ttgaattttg 
aaacctacag aaccagatgt gcagcctcag 
tgactttcac cacagctttc ccatgaacgg 
gcttgtagag gctggtgaca aagtggtctg 
agaggctgac agactccttc aactacaaca 
tgaaattcat accttcgtgg gcattgccag 
gctacaccaa attaacagca atgatgatcg 
tgtatcagct aaaacagaca tcagcatttg 
tgaaaaactg aatggaaaag cttatggctc 
taagcttctt ggcaattgct tacccactgt 
-tgccctgggt tcatctgcag ccccaaatct 
aaagttcttt gttccagata tatcaaactc 
ttcctctgga actggagaca ttttccagca 
tgtcaaacct caccatcaat tgaaaaacac 
cactatgttt ctagttacgt ggcaggccag 
tgatggacga aaatactaca caaataattt 
tctttggatt ccaggaacag ctaagcctgg 
ccatgcaaaa ttattgacct ggaagctgta 
acagcacctg gagaagactt tgatcagggc 
aaaagtctac agaatatcca agatgacttt 
cgaaatcctc agcaagctgg catcagggag 
aatggacctg aacatcagcc aaatggagaa 
atacgagcaa tggataggaa ctccttacag 
ctgtttattc cccccaattc tgatcctgta 
gttttaacag caatgggttt gataggaatc 
actttaagca ggaaaaagag agcagacaag 
aatatccaaa gtgtcttcct tcttagatat 
actaacaaag tcaaattaac atcaaaactg 
cagataagat ttttacatgg tagatcaaca 
ttacactttg gctatgaaca aataataaaa 
aaagggaagg gtaaagtcgg accagtgtca 
tagccccaag cagagaaaag gagggtaggt 
atttagttac tttgattaat ttttcttttc 
tacaactgaa gatcatgcta tatttcatat 
cttgctattt tgttatatat attacagatg 
tttcactgta agaggtaacc tttaacaata 
tttatgacaa aggtctattg aatttatttg 
tttctaagtt attgccttgg ttattatgga 
taaggaagaa aagatgttat tctgagtttg 
ttcaattaaa ccaaagaaga ggtcagcagg 
gctctgtttt ttggttaaat aagagtcttt 
agggcagggg aagggggata tagaggtcac 
ttttactcct tcctcttatt tttttaaaag 
tt 



gaatcagaac ctcatctcaa acattaagga 240 
taatgctacc aagagaagag tatttttcag 300 
gaaagctaat aataacagca aaataaaaca 360 
gactgactgg tatggggcac atggagatga 420 
aaaagaggga aaatacattc atttcacacc 480 
tggctacgga tcacgaggcc gagtgtttgt 540 
gttcgatgag tataacaatg acaaaccttt 600 
gacaaggtgt tcatctgaca tcacaggcat 660 
agaaaactgt attattagta agctttttaa 720 
ccaaaatgca actgcatcaa taatgttcat 780 
taatgcaagt acccacaacc aagaagcacc 84 0 
aagtgcatgg gatgtaatca cagactctgc 900 
gactgagctt ccacctcctc ccacattctc 960 
tttagtgctg gatgtgtcca gcaagatggc 1020 
agccgcagaa ttttatttga tgcagattgt .1080 
tttcgacagc aaaggagaga tcagagccca 1140 
aaagttgctg gtttcatatc tgcccaccac 1200 
ttcagggctt aagaaaggat ttgaggtggt 1260 
tgtgatgata ttagtgacca gcggagatga 1320 
gctcagcagt ggttcaacaa ttcactccat 1380 
ggaggaatta tcacgtctta caggaggttt 14 4 0 
caatagcatg attgatgctt tcagtagaat 1500 
acatattcag cttgaaagta caggtgaaaa 1560 
agtgactgtg gataatactg tgggcaacga 1620 
tggtcctcct gagattatat tatttgatcc 1680 
tatcaccaat ctaacttttc ggacagctag 1740 
gcactggact tacaccctga tgtgtttcca 1800 
aaagtagaag aggaattgac cctatcttgg 1860 
caggctacaa gctatgaaat aagaatgagt 1920 
aacaatgcta ttttagtaaa tacatcaaag 1980 
atatttacgt tctcacccca aatttccacg 2040 
acacatgaaa gccacagaat ttatgttgca 2100 
tctgctgtat ctaacattgc ccaggcgcct 2160 
cctgccagag attatcttat attgaaagga 2220 
atttgcctta ttatagttgt gacacatcat 2280 
aaagagaatg gaacaaaatt attataaata 2340 
aagacccatg gccttcgact acaaaaacat 2400 
tattaaaatg cattgagttt ttgtacaata 2460 
aattcttttt gggggtagat tagaaaaccc 2520 
attattcttt aaagtaatgt ctttaaaggc 2580 
aggaaagttt gttttattga ggtggaaaaa 2640 
ctgcattata actgtctgtg tgaagcaatc 2700 
tccttatctg tgcagaacag gttgcttgtt 2760 
atgaagcccc taatgcaaag ctctttacct 2820 
aaatctcact gctaatgctc agagatcttt 2880 
tgggtattac ctttgtctct tcataccggt 294 0 
tttgtaagtt tctactccca tcaaagcagc 3000 
tgatagttat agcccttata atgccttaac 3060 
ttttaataca tatatgaaca tatagtttta 3120 
gagatactaa cctttggaaa tgattagctg 3180 
aatcctttct ccatcaagag ttacttacca 3240 
aaggaaataa aaatcatctt tcatctttaa 3300 
attatcgaac aataaaatca tttgcctttt 3360 

3362 



<210> 168 
<211> 2784 
<212> DNA 
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<213> Homo sapiens 
<400> 168 

tctgcatcca tattgaaaac ctgacacaat gtatgcagca ggctcagtgt gagtgaactg 60 
gaggcttctc tacaacatga cccaaaggag cattgcaggt cctatttgca acctgaagtt 120 
tgtgactctc ctggttgcct taagttcaga actcccattc ctgggagctg gagtacagct 180 
tcaagacaat gggtataatg gattgctcat tgcaattaat cctcaggtac ctgagaatca 24 0 
gaacctcatc tcaaacatta aggaaatgat aactgaagct tcattttacc tatttaatgc 300 
taccaagaga agagtatttt tcagaaatat aaagatttta atacctgcca catggaaagc 360 
taataataac agcaaaataa aacaagaatc atatgaaaag gcaaatgtca tagtgactga 420 
ctggtatggg gcacatggag atgatccata caccctacaa tacagagggt gtggaaaaga 480 
gggaaaatac attcatttca cacctaattt cctactgaat gataacttaa cagctggcta 540 
cggatcacga ggccgagtgt ttgtccatga atgggcccac ctccgttggg gtgtgttcga 600 
tgagtataac aatgacaaac ctttctacat aaatgggcaa aatcaaatta aagtgacaag 660 
gtgttcatct gacatcacag gcatttttgt gtgtgaaaaa ggtccttgcc cccaagaaaa 720 
ctgtattatt agtaagcttt ttaaagaagg atgcaccttt atctacaata gcacccaaaa 780 
tgcaactgca tcaataatgt tcatgcaaag tttatcttct gtggttgaat tttgtaatgc 840 
aagtacccac aaccaagaag caccaaacct acagaaccag atgtgcagcc tcagaagtgc 900 
atgggatgta atcacagact ctgctgactt tcaccacagc tttcccatga acgggactga 960 
gcttccacct cctcccacat tctcgcttgt agaggctggt gacaaagtgg tctgtttagt 1020 
gctggatgtg tccagcaaga tggcagaggc tgacagactc cttcaactac aacaagccgc 108 0 
agaattttat ttgatgcaga ttgttgaaat tcataccttc gtgggcattg ccagtttcga 1140 
cagcaaagga gagatcagag cccagctaca ccaaattaac agcaatgatg atcgaaagtt 1200 
gctggtttca tatctgccca ccactgtatc agctaaaaca gacatcagca tttgttcagg 1260 
gcttaagaaa ggatttgagg tggttgaaaa actgaatgga aaagcttatg gctctgtgat 1320 
gatattagtg accagcggag atgataagct tcttggcaat tgcttaccca ctgtgctcag 1380 
cagtggttca acaattcact ccattgccct gggttcatct gcagccccaa atctggagga 1440 
attatcacgt cttacaggag gtttaaagtt ctttgttcca gatatatcaa actccaatag 1500 
catgattgat gctttcagta gaatttcctc tggaactgga gacattttcc agcaacatat 1560 
tcagcttgaa agtacaggtg aaaatgtcaa acctcaccat caattgaaaa acacagtgac 1620 
tgtggataat actgtgggca acgacactat gtttctagtt acgtggcagg ccagtggtcc 1680 
tcctgagatt atattatttg atcctgatgg acgaaaatac tacacaaata attt.tatcac 17 40 
caatctaact tttcggacag ctagtctttg gattccagga acagctaagc ctgggcactg 1800 
gacttacacc ctgaacaata cccatcattc tctgcaagcc ctgaaagtga cagtgacctc 1860 
tcgcgcctcc aactcagctg tgcccccagc cactgtggaa gcctttgtgg aaagagacag 1920 
cctccatttt cctcatcctg tgatgattta tgccaatgtg aaacagggat tttatcccat 1980 
tcttaatgcc actgtcactg ccacagttga gccagagact ggagatcctg ttacgctgag 2040 
actccttgat gatggagcag gtgctgatgt tataaaaaat gatggaattt actcgaggta 2100 
ttttttctcc tttgctgcaa atggtagata tagcttgaaa gtgcatgtca atcactctcc 2160 
cagcataagc accccagccc actctattcc agggagtcat gctatgtatg taccaggtta 2220 
cacagcaaac ggtaatattc agatgaatgc tccaaggaaa tcagtaggca gaaatgagga 2280 
ggagcgaaag tggggcttta gccgagtcag ctcaggaggc tccttttcag tgctgggagt 2340 
tccagctggc ccccaccctg atgtgtttcc accatgcaaa attattgacc tggaagctgt 24 00 
aaatagaaga ggaattgacc ctatcttgga cagcacctgg agaagacttt gatcagggcc 24 60 
aggctacaag ctatgaaata agaatgagta aaagtctaca gaatatccaa gatgacttta 252 0 
acaatgctat tttagtaaat acatcaaagc gaaatcctca gcaagctggc atcagggaga 2580 
tatttacgtt ctcaccccaa atttccacga atggacctga acatcagcca aatggagaaa 2640 
cacatgaaag ccacagaatt tatgttgcaa tacgagcaat ggataggaac tccttacagt 27 00 
ctgctgtatc taacattgcc caggcgcctc tgtttattcc ccccaattct gatcctgtac 27 60 
ctgccagaga ttatcttata ttga " " 2784 

<210> 169 
<211> 592 
<212> PRT 
<213> Homo sapiens 

<400> 169 

Met Thr Gin Arg Ser lie Ala Gly Pro lie Cys Asn Leu Lys Phe Val 
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Thr Leu Leu 

Val Gin Leu 
35 

Pro Gin Val 
50 

He Thr Glu 
65 

Phe Phe Arg 

Asn Asn Ser 

Val Thr Asp 
115 

Tyr Arg Gly 

130 
Phe Leu Leu 
145 

Val Phe Val 

Tyr Asn Asn 

Val Thr Arg 
195 

Gly Pro Cys 

210 
Gly Cys Thr 
225 

Met Phe Met 

Thr His Asn 

Arg Ser Ala 
275 

Phe Pro Met 

290 
Val Glu Ala 
305 

Lys Met Ala 

Phe Tyr Leu 

Ser Phe Asp 
355 

Ser Asn Asp 

370 
Ser Ala Lys 
385 

Glu Val Val 

Leu Val Thr 

Val Leu Ser 
435 

Ala Ala Pro 

450 
Phe Phe Val 



Val Ala 
20 

Gin Asp 

Pro Glu 

Ala Ser 

Asn He 

85 
Lys He 
100 

Trp Tyr 

Cys Gly 

Asn Asp 

His Glu 
165 
Asp Lys 
180 

Cys Ser 

Pro Gin 

Phe He 

Gin Ser 
245 
Gin Glu 
260 

Trp Asp 

Asn Gly 

Gly Asp 

Glu Ala 
325 
Met Gin 
340 

Ser Lys 

Asp Arg 

Thr Asp 

Glu Lys 
405 
Ser Gly 
420 

Ser Gly 
Asn Leu 
Pro Asp 



Leu Ser Ser 

Asn Gly Tyr 
40 

Asn Gin Asn 
55 

Phe Tyr Leu 
70 

Lys He Leu 

Lys Gin Glu 

Gly Ala His 
120 

Lys Glu Gly 

135 
Asn Leu Thr 
150 

Trp Ala His 

Pro Phe Tyr 

Ser Asp He 
200 

Glu Asn Cys 

215 
Tyr Asn Ser 
230 

Leu Ser Ser 

Ala Pro Asn 

Val He Thr 
280 

Thr Glu Leu 

295 
Lys Val Val 
310 

Asp Arg Leu 

He Val Glu 

Gly Glu He 
360 

Lys Leu Leu 

375 
He Ser He 
390 

Leu Asn Gly 

Asp Asp Lys 

Ser Thr He 
440 

Glu Glu Leu 

455 
He Ser Asn 



10 




15 




Glu Leu Pro Phe 


Leu Gly Ala 


Gly 


25 


30 






Asn Gly Leu Leu 


He Ala 


lie 


Asn 




45 






Leu He Ser Asn 


He Lys 


Glu 


Met 


6U 








Phe Asn Ala Thr 


Lys Arg Arg 


Val 


ID 






o n 
o u 


lie Pro Ala Thr 


Trp Lys 


Ala 


Asn 


90 




95 




Ser Tyr Glu Lys 


Ala Asn 


Val 


He 


1 n c 
105 


110 






Gly Asp Asp Pro 


Tyr Thr Leu 


Gin 




125 






Lys Tyr He His 


Phe Thr 


Pro 


Asn 


140 








Ala Gly Tyr Gly 


Ser Arg 


Gly 


Arg 


ICC 






160 


Leu Arg Trp Gly 


Val Phe Asp 


Glu 


170 




175 




He Asn Gly Gin 


Asn Gin 


He 


Lys 


1 oc 

185 


190 






Thr Gly He Phe 


Val Cys 


Glu 


Lys 




205 






He He Ser Lys 


Leu Phe 


Lys 


Glu 


220 








Thr Gin Asn Ala 


Thr Ala 


Ser 


He 


235 






240 


Val Val Glu Pne 


Cys Asn Ala 


Ser 


250 




255 




Leu Gin Asn Gin 


Met Cys 


Ser 


Leu 


zoo 


270 






Asp Ser Ala Asp 


Phe His 


His 


Ser 




285 






Pro Pro Pro Pro 


Thr Phe 


Ser 


Leu 










Cys Leu Val Leu 


Asp Val 


Ser 


Ser 


"3 1 C 






ion 


Leu Gin Leu Gin 


Gin Ala 


Ala 


GiU 


■a o n 




335 




lie His Tnr Pne 


Val Gly 


He 


Ala 


345 


350 






Arg Ala Gin Leu 


His Gin 


He 


Asn 




365 






Val Ser Tyr Leu 


Pro Thr 


Thr 


Val 










Cys Ser Gly Leu 


Lys Lys 


Gly 


Phe 


395 






400 


Lys Ala Tyr Gly 


Ser Val 


Met 


He 


410 




415 




Leu Leu Gly Asn 


Cys Leu 


Pro 


Thr 


425 


430 






His Ser He Ala 


Leu Gly Ser 


Ser 




445 






Ser Arg Leu Thr 


Gly Gly Leu 


Lys 


460 








Ser Asn Ser Met 


He Asp Ala 


Phe 
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465 470 ( 475 480 

Ser Arg lie Ser Ser Gly Thr Gly Asp lie Phe Gin Gin His He Gin 

485 490 495 

Leu Glu Ser Thr Gly Glu Asn Val Lys Pro His His Gin Leu Lys Asn 

500 505 510 

Thr Val Thr Val Asp Asn Thr Val Gly Asn Asp Thr Met Phe Leu Val 

515 520 525 

Thr Trp Gin Ala Ser Gly Pro Pro Glu He He Leu Phe Asp Pro Asp 

530 535 540 

Gly Arg Lys Tyr Tyr Thr Asn Asn Phe He Thr Asn Leu Thr Phe Arg 
545 ' 550 555 560 

Thr Ala Ser Leu Trp He Pro Gly Thr Ala Lys Pro Gly His Trp Thr 

565 570 575 

Tyr Thr Leu Met Cys Phe His His Ala Lys Leu Leu Thr Trp Lys Leu 
580 585 590 



<210> 170 
<211> 791 
<212> PRT 

<213> Homo sapiens 
<400> 170 

Met Thr Gin Arg Ser He Ala Gly Pro He Cys Asn Leu Lys Phe Val 

1 5 10 15 

Thr Leu Leu Val Ala Leu Ser Ser Glu Leu Pro Phe Leu Gly Ala Gly 

20 25 30 

Val Gin Leu Gin Asp Asn Gly Tyr Asn Gly Leu Leu He Ala He Asn 

35 40 45 

Pro Gin Val Pro Glu Asn Gin Asn Leu He Ser Asn He Lys Glu Met 

50 55 60 

He Thr Glu Ala Ser Phe Tyr Leu Phe Asn Ala Thr Lys Arg Arg Val 
65 70 75 80 

Phe Phe Arg Asn He Lys He Leu He Pro Ala Thr Trp Lys Ala Asn 

85 90 95 

Asn Asn Ser Lys He Lys Gin Glu Ser Tyr Glu Lys Ala Asn Val He 

100 105 ~ 110 

Val Thr Asp Trp Tyr Gly Ala His Gly Asp Asp Pro Tyr Thr Leu Gin 

115 120 125 

Tyr Arg Gly Cys Gly Lys Glu Gly Lys Tyr He His Phe Thr Pro Asn 

130 135 140 

Phe Leu Leu Asn Asp Asn Leu Thr Ala Gly Tyr Gly Ser Arg Gly Arg 
145 150 155 " " 160 

Val Phe Val His Glu Trp Ala His Leu Arg Trp Gly Val Phe Asp Glu 

165 170 " 175 

Tyr Asn Asn Asp Lys Pro Phe Tyr lie Asn Gly Gin Asn Gin He Lys 

180 185 190 

Val Thr Arg Cys Ser Ser Asp He Thr Gly He Phe Val Cys Glu Lys 

195 200 205 

Gly Pro Cys Pro Gin Glu Asn Cys lie He Ser Lys Leu Phe Lys Glu 

210 215 220 

Gly Cys Thr Phe He Tyr Asn Ser Thr Gin Asn Ala Thr Ala Ser He 
225 230 235 240 

Met Phe Met Gin Ser Leu Ser Ser Val Val Glu Phe Cys Asn Ala Ser 

245 250 255 

Thr His Asn Gin Glu Ala Pro Asn Leu Gin Asn Gin Met Cys Ser Leu 

260 265 270 

Arg Ser Ala Trp Asp Val He Thr Asp Ser Ala Asp Phe His His Ser 
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Z fD 








280 


Phe 


Pro 


Met 


Asn 


Gly 


Thr 


Glu Leu 




Oflft 

290 










295 


Val 


Glu 


Ala 


Gly 


Asp 


Lys 


Val Val 












J1U 




Lys 


Met 


Ala 


Glu 


AXa 


Asp 


Arg Leu 
















tfne 


Tyr 


Leu 


Met 


p ~\ _ 

Gxn 


T I - 

ixe 


T7 _ T /11„ 

val Glu 








J4U 








Ser 


Phe 


Asp 


Ser 


Lys 


Gly 


Glu lie 






355 








360 


Ser 


Asn 


Asp 


Asp 


Arg 


Lys 


Leu Leu 




J> /U 










J75 


Ser 


TV 1 -» 

Ala 


iiys 


Tnr 


Asp 


xxe 


Ser lie 


jOO 














ulU 


vai 


vax 


G1U 


Lys 


Leu 


Asn Gly 










40o 






T nn 


vai 


Tnr 


Ser 


pi *■ 

GXy 


Asp 


Asp Lys 
















vax 


Leu 


oer 


Ser 


t?xy 


Ser 


Tnr lie 






A 

SOD 








a a n 


711- 


TV 1 — 


Pro 


Asn 


Leu 


bXU 


Glu Leu 














A CC 


Fiie 


Pne 


IT- T 

vax 


Pro 


Asp 


lie 


Ser Asn 


4 DO 










4 /u 




Ser 


TV v- /~r 

Arg 


lie 


Ser 


Ser 


PI,, 

GXy 


Tnr Gly 
















lieu 


CalU 


ser 


inr 


oxy 


(jXU 


Asn val 








con 








inr 


vai 


inr 


vax 


ASp 


7V , 

Asn 


inr vai 






ci t. 

DID 








con 


rpVi -»- 

inr 


Trp 


Gxn 


Ala 


Ser 


Gly 


Pro Pro 




a 












biy 


Arg 


Lys 


Tyr 


Tyr 


Thr 


Asn Asn 












CCA 




inr 


ax a 


oer 


Leu 


Trp 


ixe 


Pro Gly 
















Tyr 


Tnr 


Leu 


Asn 


Asn 


Tnr 


His His 








ton 








vai 


Thr 


Ser 


Arg 


AXa 


Ser 


Asn Ser 






C ft c 








C A A 


Ilia 

A. J. el 


rile 


vax 


l ii 


Arg 


Asp 


Ser Leu 




C*l A 
Dl U 










Dl J 


Tyr 


ax a 


Asn 


vax 


Lys 


bin 


pi,, DVici 

bxy irne 












boU 




rp V-i v- 

Tnr 


Ala 


Tnr 


vax 


PI ii 

GXU 


fro 


pT IpL, „ 

Glu Tnr 










645 






Leu 


Asp 


Asp 


Gly 


Ala 


Gly 


Ala Asp 








660 








Ser 


Arg 


Tyr 


Phe 


Phe 


Ser 


Phe Ala 






675 








680 


Val 


His 


Val 


Asn 


His 


Ser 


Pro Ser 




690 










695 


Pro 


Gly 


Ser 


His 


Ala 


Met 


Tyr Val 


705 










710 




He 


Gin 


Met 


Asn 


Ala 


Pro 


Arg Lys 










725 






Arg 


Lys 


Trp 


Gly 


Phe 


Ser 


Arg Val 



285 



Pro 


Pro 


Pro 


Pro 
300 


Thr 


Phe 


Ser 


Leu 


Cys 


Leu 


Val 
315 


Leu 


Asp 


Val 


Ser 


Ser 
320 


Leu 


Gin 
330 


Leu 


Gin 


Gin 


Ala 


Ala 
335 


Glu 


He 


His 


Thr 


Phe 


Val 


Gly 


He 


Ala 


345 










350 






Arg 


Ala 


Gin 


Leu 


His 
365 


Gin 


He 


Asn 


Val 


Ser 


Tyr 


Leu 
380 


Pro 


Thr 


Thr 


Val 


Cys 


Ser 


Gly 
395 


Leu 


Lys 


Lys 


Gly 


Phe 
400 


Lys 


TV T _ 

AXa 
410 


Tyr 


Gly 


Ser 


vai 


Met 
415 


lie 


Leu 


Leu 


Gly 


Asn 


Cys 


Leu 


Pro 


Thr 


425 










430 






His 


Ser 


He 


Ala 


Leu 
445 


Gly 


Ser 


Ser 


Ser 


Arg 


Leu 


Thr 
460 


Gly 


Gly 


Leu 


Lys 


Ser 


Asn 


Ser 
475 


Met 


He 


Asp 


Ala 


Phe 
480 


Asp 


He 
490 


Phe 


Gin 


Gin 


His 


He 
495 


Gin 


Lys 


Pro 


His 


His 


Gin 


Leu 


Lys 


Asn 


oOd 










CI A 

OlU 






Gly 


Asn 


Asp 


Thr 


Met 
525 


Phe 


Leu 


Val 


Glu 


He 


He 


Leu 
540 


Phe 


Asp 


Pro 


Asp 


Phe 


He 


Thr 
555 


Asn 


Leu 


Thr 


Phe 


Arg 
560 


ml, 

inr 


AXa 
570 


Lys 


Pro 


«xy 


nlS 


Trp 

C 1 c 


inr 


Ser 


Leu 


Gin 


Ala 


Leu 


Lys 


Val 


Thr 


585 










590 






Ala 


Val 


Pro 


Pro 


Ala 
605 


Thr 


Val 


Glu 


His 


Phe 


Pro 


His 
620 


Pro 


Val 


Met 


He 


Tyr 


Pro 


He 
635 


Leu 


Asn 


Ala 


Thr 


Val 
640 


CI \7 

\3JLy 


Asp 
650 


Pro 


Vdl 


XXII. 


lieu 


Arg 
655 


Leu 


Val 


He 


Lys 


Asn 


Asp 


Gly 


He 


Tyr 


665 










670 






Ala 


Asn 


Gly 


Arg 


Tyr 
685 


Ser 


Leu 


Lys 


He 


Ser 


Thr 


Pro 
700 


Ala 


His 


Ser 


He 


Pro 


Gly 


Tyr 
715 


Thr 


Ala 


Asn 


Gly 


Asn 
720 


Ser 


Val 
730 


Gly 


Arg 


Asn 


Glu 


Glu 
735 


Glu 


Ser 


Ser 


Gly 


Gly 


Ser 


Phe 


Ser 


Val 
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740 745 750 

Leu Gly Val Pro Ala Gly Pro His Pro Asp Val Phe Pro Pro Cys Lys 

755 760 765 

He He Asp Leu Glu Ala Val Asn Arg Arg Gly He Asp Pro He Leu 

770 775 780 

Asp Ser Thr Trp Arg Arg Leu 
785 790 



<210> 171 

<211> 1491 

<212> DNA 

<213> Homo sapiens 

<400> 171 

cctcctgcca gccaagtgaa gacatgctta cttccccttc accttccttc atgatgtggg 60 
aagagtgctg caacccagcc ctagccaacg ccgcatgaga gggagtgtgc cgagggcttc 120 
tgagaaggtt tctctcacat ctagaaagaa gcgcttaaga tgtggcagcc cctcttcttc 180 
aagtggctct tgtcctgttg ccctgggagt tctcaaattg ctgcagcagc ctccacccag 240 
cctgaggatg acatcaatac acagaggaag aagagtcagg aaaagatgag agaagttaca 300 
gactctcctg ggcgaccccg agagcttacc attcctcaga cttcttcaca tggtgctaac 360 
agatttgttc ctaaaagtaa agctctagag gccgtcaaat tggcaataga agccgggttc 420 
caccatattg attctgcaca tgtttacaat aatgaggagc aggttggact ggccatccga 480 
agcaagattg cagatggcag tgtgaagaga gaagacatat tctacacttc aaagctttgg 540 
agcaattccc atcgaccaga gttggtccga ccagccttgg aaaggtcact gaaaaatctt 600 
caattggact atgttgacct ctatcttatt cattttccag tgtctgtaaa gccaggtgag 660 
gaagtgatcc caaaagatga aaatggaaaa atactatttg acacagtgga tctctgtgcc 720 
acatgggagg ccatggagaa gtgtaaagat gcaggattgg ccaagtccat cggggtgtcc 780 
aacttcaacc acaggctgct ggagatgatc ctcaacaagc cagggctcaa gtacaagcct 840 
gtctgcaacc aggtggaatg tcatccttac ttcaaccaga gaaaactgct ggatttctgc 900 
aagtcaaaag acattgttct ggttgcctat agtgctctgg gatcccatcg agaagaacca 960 
tgggtggacc cgaactcccc ggtgctcttg gaggacccag tcctttgtgc cttggcaaaa 1020 
aagcacaagc gaaccccagc cctgattgcc ctgcgctacc agctgcagcg tggggttgtg 1080 
gtcctggcca agagctacaa tgagcagcgc atcagacaga acgtgcaggt gtttgaattc 1140 
cagttgactt cagaggagat gaaagccata gatggcctaa acagaaatgt gcgatatttg 1200 
acccttgata tttttgctgg cccccctaat tatccatttt ctgatgaata ttaacatgga 1260 
gggcattgca tgaggtctgc cagaaggccc tgcgtgtgga tggtgacaca gaggatggct 1320 
ctatgctggt gactggacac atcgcctctg gttaaatctc tcctgcttgg cgacttcagt 1380 
aagctacagc taagcccatc ggccggaaaa gaaagacaat aattttgttt ttcattttga 1440 
aaaaattaaa tgctctctcc taaagattct tcacctaaaa aaaaaaaaaa a 1491 

<210> 172 
<211> 364 
<212> PRT 
<213> Homo sapiens 

<400> 172 

Met Trp Gin Pro Leu Phe Phe Lys Trp Leu Leu Ser Cys Cys Pro Gly 

1 5 ' 10 15 

Ser Ser Gin He Ala Ala Ala Ala Ser Thr Gin Pro Glu Asp Asp He 

20 25 30 

Asn Thr Gin Arg Lys Lys Ser Gin Glu Lys Met Arg Glu Val Thr Asp 

35 40 45 

Ser Pro Gly Arg Pro Arg Glu Leu Thr lie Pro Gin Thr Ser Ser His 

50 55 60 

Gly Ala Asn Arg Phe Val Pro Lys Ser Lys Ala Leu Glu Ala Val Lys 
65 70 75 80 

Leu Ala He Glu Ala Gly Phe His His He Asp Ser Ala His Val Tyr 
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85 


90 


95 




Asn 


Asn 


Glu 


Glu 
100 


Gin 


Val Gly Leu Ala lie Arg Ser Lys lie 
105 110 


Ala 


Asp 


Gly 


Ser 


Val 
115 


Lys 


Arg 


Glu Asp He Phe Tyr Thr Ser Lys Leu 
120 125 


Trp 


Ser 


Asn 


Ser 
130 


His 


Arg 


Pro 


Glu Leu Val Arg Pro Ala Leu Glu Arg 
135 140 


Ser 


Leu 


Lys 


Asn 


Leu 


Gin 


Leu 


Asp Tyr Val Asp Leu Tyr Leu He His 


Phe 


Pro 


140 










150 155 




160 


Val 


Ser 


Val 


Lys 


Pro 
165 


Gly Glu Glu Val He Pro Lys Asp Glu 
170 


Asn 
175 


Gly 


Lys 


lie 


Leu 


Phe Asp 


Thr Val Asp Leu Cys Ala Thr Trp Glu 


Ala 


Met 








180 




185 190 






Glu 


Lys 


Cys 
195 


Lys 


Asp 


Ala Gly Leu Ala Lys Ser He Gly Val 
200 ~ 205 


Ser 


Asn 


Pne 


Asn 


His 


Arg Leu 


Leu Glu Met He Leu Asn Lys Pro Gly 


Leu 


Lys 




210 








215 220 






Tyr 


Lys 


Pro 


Val 


Cys 


Asn Gin Val Glu Cys His Pro Tyr Phe 


Asn 


Gin 


225 










230 235 




240 


Arg 


Lys 


Leu 


Leu Asp 


Phe Cys Lys Ser Lys Asp He Val Leu 


Val 


Ala 










245 


250 


255 




Tyr 


Ser 


Ala 


Leu 
260 


Gly 


Ser His Arg Glu Glu Pro Trp Val Asp 
265 270 


Pro 


Asn 


Ser 


Pro 


Val 
275 


Leu 


Leu 


Glu Asp Pro Val Leu Cys Ala Leu Ala 
280 285 


Lys 


Lys 


His 


Lys 
290 


Arg 


Thr 


Pro 


Ala Leu He Ala Leu Arg Tyr Gin Leu 
295 300 


Gin 


Arg 


Gly 


Val 


Val 


Val 


Leu 


Ala Lys Ser Tyr Asn Glu Gin Arg He 


Arg 


Gin 


305 










310 315 




320 


Asn 


Val 


Gin 


Val 


Phe 
325 


Glu Phe Gin Leu Thr Ser Glu Glu Met 
330 


Lys 
335 


Ala 


lie 


Asp 


Gly 


Leu 
340 


Asn 


Arg Asn Val Arg Tyr Leu Thr Leu Asp 
345 350 


He 


Phe 


Ala 


Gly 


Pro 
355 


Pro 


Asn 


Tyr Pro Phe Ser Asp Glu Tyr 
360 







<210> 173 
<211> 1988 
<212> DNA 

<213> Homo sapiens 
<400> 173 

cgggagccgc ctccccgcgg cctcttcgct 
tctctgctgt cgcccgtccc gcgcgctcct 
ccgcgccgcc cgtcaacatg atccgctgcg 
tgcccctgct cctactcagc gccatcgcct 
ggttgcagtc tagcgaccac ggccagacgt 
gcggcggcag cgggtcctac gaggagggct 
gagcagcggc tgccatgctc ttctgtggct 
ccttcttcgc cctctgtgga ccccagatgc 
ttgccttggc tgctgtgttc cagatcatct 
agaccttcac ccttcatgcc aaccctgctg 
ttgggtgggc agccacgatt atcctgatcg 
actacgaaga tgaccttctg ggcaatgcca 
ttgggaatga atgtgggaga aaatcgctgc 
tttctccagg cgactttgaa cccatttttt 
aatgctaaaa taatttggga gaaaatattt 



tttgtggcgg cgcccgcgct cgcaggccac 60 
ccgacccgct ccgctccgct ccgctcggcc 120 
gcctggcctg cgagcgctgc cgctggatcc 180 
tcgacatcat cgcgctggcc ggccgcggct 240 
cctcgctgtg gtggaaatgc tcccaagagg 300 
gtcagagcct catggagtac gcgtggggta 360 
tcatcatcct ggtgatctgt ttcatcctct 420 
ttgtcttcct gagagtgatt ggaggtctcc 4 80 
ccctggtaat ttaccccgtg aagtacaccc 540 
tcacttacat ctataactgg gcctacggct 600 
gctgtgcctt cttcttctgc tgcctcccca 660 
agcccaggta cttctacaca tctgcctaac 720 
tgctgagatg gactccagaa gaagaaactg 780 
ggcagtgttc atattattaa actagtcaaa 84 0 
tttaagtagt gttatagttt catgtttatc 900 
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ttgtgtcttt tcactaatta cctatactat gccaatattt 960 
tttatactac atttgtaaga gaatatgcac gtgaaactta 1020 
gaggtttcca agatttaata atctgatcaa gttcttgtta 1080 
ggtctgttaa gggctaagga gaagaggaag ataaggttaa 1140 
ttctaaaaga aatgcaaaaa aaaagtttat tttcaagcct 1200 
aaaatcattt cctaaatgca tatcatttgt gagaatttct 1260 
catttcagct aaggcttcat gttgactcga tatgtcatct 1320 
tccaaacctg ttgccatagt tggtaaggct ttcctttaag 1380 
ttttctcttt taaagttctt tatagggtta gggtgtggga 1440 
tgtagtgttt tgtgtttata tgttcagaac cagagtagac 1500 
gtctaattta tcatgactga tagatctggt taagttgtgt 1560 
attcytgtca caaaagtgcc actaaaacag cctcaggaga 1620 
aatctcaggt ttatctgggc tctatcatat agacaggctt 1680 
gcagaaacct acatatagtt aaaatcctgg tctttcttgg 1740 
gatataaaac atgccacagg agaattcggg gatttgagtt 1800 
gatgcatcgg ataggtcatt atgatttttt accatttcga 1860 
tcattttaaa tatcagatta ttattttgta agttgtggaa 1920 
ttatgaagtt ttcccaataa accaggtatt ctaaaaaaaa 1980 

1988 



<400> 174 



Gly 


Ala 


Ala 


Ser 


Pro 


Arg 


Pro 


Leu 


Arg 


Phe 


Cys Gly 


Gly 


Ala Arg 


Ala 


1 








5 










10 








15 




Arg 


Arg 


Pro 


Leu 


Ser 


Ala 


Val 


Ala 


Arg 


Pro 


Ala Arg 


Ser 


Ser Asp 


Pro 








20 










25 










30 




Leu 


Arg 


Ser 


Ala 


Pro 


Leu 


Gly 


Pro 


Ala 


Pro 


Pro 


Val 


Asn 


Met He 


Arg 






35 










40 










45 




Cys 


Gly 
50 


Leu 


Ala 


Cys 


Glu 


Arg 
55 


Cys 


Arg 


Trp 


He 


Leu 
60 


Pro 


Leu Leu 


Leu 


Leu 


Ser 


Ala 


He 


Ala 


Phe 


Asp 


He 


He 


Ala 


Leu 


Ala 


Gly 


Arg Gly 


Trp 


65 










70 










75 




80 


Leu 


Gin 


Ser 


Ser 


Asp 
85 


His 


Gly 


Gin 


Thr 


Ser 
90 


Ser 


Leu 


Trp 


Trp Lys 
95 


Cys 


Ser 


Gin 


Glu Gly 


Gly 


Gly 


Ser 


Gly 


Ser 


Tyr 


Glu 


Glu 


Gly 


Cys Gin 


Ser 








100 










105 










110 




Leu 


Met 


Glu 


Tyr 


Ala 


Trp 


Gly 


Arg 


Ala 


Ala 


Ala 


Ala 


Met 


Leu Phe 


Cys 






115 








120 










125 




Gly 


Phe 
130 


He 


He 


Leu 


Val 


He 
135 


Cys 


Phe 


He 


Leu 


Ser 
140 


Phe 


Phe Ala 


Leu 


Cys 


Gly 


Pro 


Gin 


Met 


Leu^ Val 


Phe 


Leu 


Arg 


Val 


He 


Gly 


Gly Leu 


Leu 


145 










150 










155 




160 


Ala 


Leu 


Ala 


Ala 


Val 
165 


Phe 


Gin 


He. 


He 


Ser 
170 


Leu 


Val 


He 


Tyr Pro 
175 


Val 


Lys 


Tyr 


Thr 


Gin 


Thr 


Phe 


Thr 


Leu 


His 


Ala 


Asn 


Pro 


Ala 


Val Thr 


Tyr 








180 










185 










190 


He 


Tyr 


Asn 


Trp 


Ala 


Tyr Gly 


Phe 


Gly 


Trp 


Ala 


Ala 


Thr 


He He 


Leu 






195 










200 










205 






He 


Gly 


Cys 


Ala 


Phe 


Phe 


Phe 


Cys 


Cys 


Leu 


Pro 


Asn 


Tyr 


Glu Asp 


Asp 




210 










215 










220 




Leu 


Leu 


Gly Asn 


Ala 


Lys 


Pro 


Arg 


Tyr 


Phe 


Tyr 


Thr 


Ser 


Ala 




225 










230 










235 











ttttattatg ttttgtgaag 
ccttatatct atccataaca 
acactttata aggtaaaaat 
tttccaaata gaatggactt 
aagttgttaa tgaccaaaca 
tcgaactatt taaggaaagc 
cattaatatc ctgaatcatt 
aggaaagtac tatttcatgg 
tgtgaaatat ttagatgaaa 
aaatgctata ttaataaatc 
tggattgaaa gatggactgg 
agtaaagcat taggagggtc 
ataaatgact tgcttttcta 
ctgatagttt gcarctgtaa 
taaacagatt ttaaatgtct 
tctctgaata gcatatatat 
cttacataat gaaaaccaat 
aaagctaatt gtagttttca 
aaaaaaaa 

<210> 174 
<211> 238 
<212> PRT 

<213> Homo sapiens 
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<210> 175. 
<211> 4181 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 3347, 3502, 3506, 3520, 3538, 3549, 3646, 3940, 3968, 3974, 
4036, 4056, 4062, 4080, 4088, 4115 
<223> n = A,T,C or G 

<400> 175 

ggtggatgcg tttgggttgt agctaggctt tttcttttct ttctctttta aaacacatct 60 
agacaaggaa aaaacaagcc tcggatctga tttttcactc ctcgttcttg tgcttggttc 120 
ttactgtgtt tgtgtatttt aaaggcgaga agacgagggg aacaaaacca gctggatcca 180 
tccatcaccg tgggtggttt taatttttcg ttttttctcg ttattttttt ttaaacaacc 240 
actcttcaca atgaacaaac tgtatatcgg aaacctcagc gagaacgccg ccccctcgga 300 
cctagaaagt atcttcaagg acgccaagat cccggtgtcg ggacccttcc tggtgaagac 360 
tggctacgcg ttcgtggact gcccggacga gagctgggcc ctcaaggcca tcgaggcgct 420 
ttcaggtaaa atagaactgc acgggaaacc catagaagtt gagcactcgg tcccaaaaag 480 
gcaaaggatt cggaaacttc agatacgaaa tatcccgcct catttacagt gggaggtgct 540 
ggatagttta ctagtccagt atggagtggt ggagagctgt gagcaagtga acactgactc 600 
ggaaactgca gttgtaaatg taacctattc cagtaaggac caagctagac aagcactaga 660 
caaactgaat ggatttcagt tagagaattt caccttgaaa gtagcctata tccctgatga 720 
aatggccgcc cagcaaaacc ccttgcagca gccccgaggt cgccgggggc ttgggcagag 780 
gggctcctca aggcaggggt ctccaggatc cgtatccaag cagaaaccat gtgatttgcc 840 
tctgcgcctg ctggttccca cccaatttgt tggagccatc ataggaaaag aaggtgccac 900 
cattcggaac atcaccaaac agacccagtc taaaatcgat gtccaccgta aagaaaatgc 960 
gggggctgct gagaagtcga ttactatcct ctctactcct gaaggcacct ctgcggcttg 1020 
taagtctatt ctggagatta tgcataagga agctcaagat ataaaattca cagaagagat 1080 
ccccttgaag attttagctc ataataactt tgttggacgt cttattggta aagaaggaag 1140 
aaatcttaaa aaaattgagc aagacacaga cactaaaatc acgatatctc cattgcagga 1200 
attgacgctg tataatccag aacgcactat tacagttaaa ggcaatgttg agacatgtgc 1260 
caaagctgag gaggagatca tgaagaaaat cagggagtct tatgaaaatg atattgcttc 1320 
tatgaatctt caagcacatt taattcctgg attaaatctg aacgccttgg gtctgttccc 1380 
acccacttca gggatgccac ctcccacctc agggccccct tcagccatga ctcctcccta 1440 
cccgcagttt gagcaatcag aaacggagac tgttcatcag tttatcccag ctctatcagt 1500 
cggtgccatc atcggcaagc agggccagca catcaagcag ctttctcgct ttgctggagc 1560 
ttcaattaag attgctccag cggaagcacc agatgctaaa gtgaggatgg tgattatcac 1620 
tggaccacca gaggctcagt tcaaggctca gggaagaatt tatggaaaaa ttaaagaaga 1680 
aaactttgtt agtcctaaag aagaggtgaa acttgaagct catatcagag tgccatcctt 1740 
tgctgctggc agagttattg gaaaaggagg caaaacggtg aatgaacttc agaatttgtc 1800 
aagtgcagaa gttgttgtcc ctcgtgacca gacacctgat gagaatgacc aagtggttgt 1860 
caaaataact ggtcacttct atgcttgcca ggttgcccag agaaaaattc aggaaattct 1920 
gactcaggta aagcagcacc aacaacagaa ggctctgcaa agtggaccac ctcagtcaag 1980 
acggaagtaa aggctcagga aacagcccac cacagaggca gatgccaaac caaagacaga 2040 
ttgcttaacc aacagatggg cgctgacccc ctatccagaa tcacatgcac aagtttttac 2100 
ctagccagtt gtttctgagg accaggcaac ttttgaactc ctgtctctgt gagaatgtat 2160 
actttatgct ctctgaaatg tatgacaccc agctttaaaa caaacaaaca aacaaacaaa 2220 
aaaagggtgg gggagggagg gaaagagaag agctctgcac ttccctttgt tgtagtctca 2280 
cagtataaca gatattctaa ttcttcttaa tattccccca taatgccaga aattggctta 2340 
atgatgcttt cactaaattc atcaaataga ttgctcctaa atccaattgt taaaattgga 2400 
tcagaataat tatcacagga acttaaatgt taagccatta gcatagaaaa actgttctca 2460 
gttttatttt tacctaacac taacatgagt aacctaaggg aagtgctgaa tggtgttggc 2520 
aggggtatta aacgtgcatt tttactcaac tacctcaggt attcagtaat acaatgaaaa 2580 
gcaaaattgt tccttttttt tgaaaatttt atatacttta taatgataga agtccaaccg 2640 
ttttttaaaa aataaattta aaatttaaca gcaatcagct aacaggcaaa ttaagatttt 2700 
tacttctggc tggtgacagt aaagctggaa aattaatttc agggtttttt gaggcttttg 2760 
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acacagttat tagttaaatc aaatgttcaa aaatacggag cagtgcctag tatctggaga 2820 
gcagcactac catttattct ttcatttata gttgggaaag tttttgacgg tactaacaaa 2880 
gtggtcgcag gagattttgg aacggctggt ttaaatggct tcaggagact tcagtttttt 2 940 
gtttagctac atgattgaat gcataataaa tgctttgtgc ttctgactat caatacctaa 3000 
agaaagtgca tcagtgaaga gatgcaagac tttcaactga ctggcaaaaa gcaagcttta 3060 
gcttgtctta taggatgctt agtttgccac tacacttcag accaatggga cagtcataga 3120 
tggtgtgaca gtgtttaaac gcaacaaaag gctacatttc catggggcca gcactgtcat 3180 
gagcctcact aagctatttt gaagattttt aagcactgat aaattaaaaa aaaaaaaaaa 3240 
aaattagact ccaccttaag tagtaaagta taacaggatt tctgtatact gtgcaatcag 3300 
ttctttgaaa aaaaagtcaa aagatagaga atacaagaaa agttttnggg atataatttg 3360 
aatgactgtg aaaacatatg acctttgata acgaactcat ttgctcactc cttgacagca 3420 
aagcccagta cgtacaattg tgttgggtgt gggtggtctc caaggccacg ctgctctctg 3480 
aattgatttt ttgagttttg gnttgnaaga tgatcacagn catgttacac tgatcttnaa 3540 
ggacatatnt tataaccctt taaaaaaaaa atcccctgcc tcattcttat ttcgagatga 3600 
atttcgatac agactagatg tctttctgaa gatcaattag acattntgaa aatgatttaa 3660 
agtgttttcc ttaatgttct ctgaaaacaa gtttcttttg tagttttaac caaaaaagtg 3720 
ccctttttgt cactggtttc tcctagcatt catgattttt ttttcacaca atgaattaaa 3780 
attgctaaaa tcatggactg gctttctggt tggatttcag gtaagatgtg tttaaggcca 3840 
gagcttttct cagtatttga tttttttccc caatatttga ttttttaaaa atatacacat 3900 
aggagctgca tttaaaacct gctggtttaa attctgtcan atttcacttc tagcctttta 3960 
gtatggcnaa tcanaattta cttttactta agcatttgta atttggagta tctggtacta 4020 
gctaagaaat aattcnataa ttgagttttg tactcnccaa anatgggtca ttcctcatgn 4080 
ataatgtncc cccaatgcag cttcattttc caganacctt gacgcaggat aaattttttc 4140 
atcatttagg tccccaaaaa aaaaaaaaaa aaaaaaaaaa a 4181 

<210> 176 
<211> 579 
<212> PRT 

<213> Homo sapiens 



<400> 176 



Met 


Asn 


Lys Leu 


Tyr 


He 


Gly 


Asn 


Leu 


Ser 


Glu Asn Ala Ala Pro Ser 


1 






5 










10 


15 


Asp 


Leu 


Glu Ser 


He 


Phe 


Lys 


Asp 


Ala 


Lys 


He Pro Val Ser Gly Pro 






20 










25 




30 


Phe 


Leu 


Val Lys 


Thr 


Gly 


Tyr 


Ala 


Phe 


Val 


Asp Cys Pro Asp Glu Ser 






35 








40 






45 


Trp 


Ala 


Leu Lys 


Ala 


He 


Glu 


Ala 


Leu 


Ser 


Gly Lys He Glu Leu His 




50 . 








55 








60 


Gly 


Lys 


Pro He 


Glu 


Val 


Glu 


His 


Ser 


Val 


Pro Lys Arg Gin Arg He 


65 








70 










75 80 


Arg 


Lys 


Leu Gin 


He 


Arg 


Asn 


He 


Pro 


Pro 


His Leu Gin Trp Glu Val 








85 










90 


95 


Leu 


Asp 


Ser Leu 


Leu 


Val 


Gin 


Tyr 


Gly 


Val 


Val Glu Ser Cys Glu Gin 






100 










105 




110 


Val 


Asn 


Thr Asp 


Ser 


Glu 


Thr 


Ala 


Val 


Val 


Asn Val Thr Tyr Ser Ser 






115 








120 






125 


Lys 


Asp 


Gin Ala 


Arg 


Gin 


Ala 


Leu 


Asp 


Lys 


Leu Asn Gly Phe Gin Leu 




130 








135 








140 


Glu 


Asn 


Phe Thr 


Leu 


Lys 


Val 


Ala 


Tyr 


He 


Pro Asp Glu Met Ala Ala 


145 








150 










155 160 


Gin 


Gin 


Asn Pro 


Leu 


Gin 


Gin 


Pro 


Arg 


Gly 


Arg Arg Gly Leu Gly Gin 








165 










170 


175 


Arg 


Gly 


Ser Ser 


Arg 


Gin 


Gly 


Ser 


Pro 


Gly 


Ser Val Ser Lys Gin Lys 






180 










185 




190 


Pro 


Cys 


Asp Leu 


Pro 


Leu 


Arg 


Leu 


Leu 


Val 


Pro Thr Gin Phe Val Gly 






195 








200 






205 


Ala 


He 


He Gly 


Lys 


Glu 


Gly 


Ala 


Thr 


He 


Arg Asn He Thr Lys Gin 
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£10 






215 




ml. „ 1 

Tnr Gin 


Ser Lys lie 


Asp Vai 


His 






230 






Glu Lys 


Ser lie Thr 


He 


Leu 


Ser 




O A C 

Z45 








Cys Lys 


Ser He Leu 


Glu 


He 


Met 




O £ft 








Pne Tnr 


Glu Glu He 


Pro 


Leu 


Lys 




0*7 c: 

270 






n 0 ft 
280 


Gly Arg 


Leu He Gly 


Lys Glu 


Gly 


1 ft ft 

290 






295 




Asp Thr 


Asp Thr Lys 


He 


Thr 


He 


"3ft E 

oU5 




310 






Tyr Asn 


Pro Glu Arg 


Thr 


He 


Thr 




J£0 








7i T •» Tun 

AX a Lys 


Ala GlU Glu 


Glu 


He 


Met 












Asn Asp 


lie Mu oer 


Met 


Asn 


Leu 










JDU 




nsn nia jueu 


Gly Leu 


rue 








375 




Pro Thr 


Ser Gly Pro 


Pro 


Ser 


Ala 


JO 0 




390 






GlU bin 


Ser Glu Thr 


Glu 


Thr 


vai 




4U5 








vai Gly 


Ala lie lie 


Gly Lys 


Gin 




42U 








Arg Fne 


Ala Gly Ala 


Ser 


He 


Lys 




/IOC 

4 JO 






441) 


A±a Lys 


vai Arg Met 


Vai 


He 


lie 








455 




Lys Ala 


Gin Gly Arg 


He 


Tyr 


Gly 


ACQ 




470 






Ser Pro 


T ire l"*""! n fin 

i>ys GlU GlU 


Vai 


Lys 


Leu 




485 








Phe Ala 


Ala Gly Arg 


Vai 


He 


Gly 




500 








Leu Gin 


Asn Leu Ser 


Ser 


Ala 


Glu 




515 






520 


Pro Asp 


Glu Asn Asp 


Gin 


Vai 


Vai 


530 






535 




Ala Cys 


Gin Vai Ala 


Gin Arg 


Lys 


545 




550 






Lys Gin 


His Gin Gin 


Gin Lys 


Ala 




565 








Arg Arg 


Lys 









220 



Arg 


Lys 


Glu 
235 


Asn 


Ala 


Gly 


Ala 


Ala 
240 


inr 


rro 

O C ft 

250 


GlU 


Giy 


Thr 


Ser 


Ala 
255 


71 1 ~x 

Ala 


His 


Lys 


Glu 


Ala 


Gin 


Asp 


He 


Lys 


0 c c 
265 










270 






He 


Leu 


Ala 


His 


Asn 
285 


Asn 


Phe 


Val 


Arg 


Asn 


Leu 


Lys 
300 


Lys 


He 


Glu 


Gin 


Ser 


Pro 


Leu 
315 


Gin 


Glu 


Leu 


Thr 


Leu 
320 


val 


Lys 
330 


Gly 


Asn 


Val 


Glu 


Thr 
335 


Cys 


Lys 


Lys 


He 


Arg 


Glu 


Ser 


Tyr 


Glu 


"3 /» C 
J45 










350 






Gin 


Ala 


His 


Leu 


He 
Jb5 


Pro 


Gly 


Leu 


Pro 


Pro 


Thr 


Ser 
0 0 ft 


Gly 


Met 


Pro 


Pro 


Met 


Thr 


Pro 

n n c 

395 


Pro 


Tyr 


Pro 


Gin 


Phe 
400 


riis 


Gin 

A 1 ft 

410 


rne 


lie 


fro 


Ala 


T an 

lieu 

A 1 C 
415 


ber 


Gly 


Gin 


His 


He 


Lys 


Gin 


Leu 


Ser 


A O Q 










/Oft 

4 JO 






He 


Ala 


Pro 


Ala 


Glu 

A A C 

445 


Ala 


Pro 


Asp 


Thr 


Gly 


Pro 


Pro 
460 


Glu 


Ala 


Gin 


Phe 


Lys 


He 


Lys 
475 


Glu 


Glu 


Asn 


Phe 


Val 
480 


Glu 


Ala 
490 


His 


He 


Arg 


Val 


Pro 
4 95 


Ser 


Lys 


Gly 


Gly 


Lys 


Thr 


Val 


Asn 


Glu 


505 










510 






Val 


Val 


Val 


Pro 


Arg 
525 


Asp 


Gin 


Thr 


Val 


Lys 


He 


Thr 
540 


Gly 


His 


Phe 


Tyr 


He 


Gin 


Glu 
555 


He 


Leu 


Thr 


Gin 


Val 
560 


Leu 


Gin 
570 


Ser 


Gly 


Pro 


Pro 


Gin 
575 


Ser 



<210> 177 
<211> 401 
<212> DNA 
<213> Homo sapiens 



<400> 177 

atgccccgta aatgtcttca gtgttcttca 
agatccaaac aaatacacat tctgtgtttt 
cacacagcaa aaaattgttt actttgttgg 



gggtagttgg gatctcaaaa gatttggttc 60 
agctcagtgt tttctaaaaa aagaaactgc 120 
acaaaccaaa tcagttctca aaaaatgacc 180 
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ggtgcttata aaaagttata aatatcgagt 
gaagtgagct tgtgcttagt atttacattg 
gcaaactggt gcagaaattc tataaactct 
attttgtttt gttttgtaaa aatgataaaa 

<210> 178 
<211> 561 
<212> DNA 
<213> Homo sapiens 



agctctaaaa caaaccacct gaccaagagg 240 
gatgccagtt ttgtaatcac tgacttatgt 300 
ttgctgtttt tgatacctgc tttttgtttc 360 
cttcagaaaa t 401 



<400> 178 

acgcctttca agggtgtacg caaagcactc attgataccc ttttggatgg ctatgaaaca 60 

gcccgctatg ggacaggggt ctttggccag aatgagtacc tacgctatca ggaggccctg 120 

agtgagctgg ccactgcggt taaagcacga attgggagct ctcagcgaca tcaccagtca 180 

gcagccaaag acctaactca gtcccctgag gtctccccaa caaccatcca ggtgacatac 24 0 

ctcccctcca gtcagaagag taaacgtgcc aagcacttcc ttg;aattgaa gagctttaag 300 

gataactata acacattgga gagtactctg tgacggagct gaaggactct tgccgtagat 360 

taagccagtc agttgcaatg tgcaagacag gctgcttgcc gggccgccct cggaacatct 420 

ggcccagcag gcccagactg tatccatcca agttcccgtt gtatccagag ttcttagagc 480 

ttgtgtctaa agggtaattc cccaaccctt ccttatgagc atttttagaa cattggctaa 540 

gactattttc ccccagtagc g 561 

<210> 179 
<211> 521 
<212> DNA 
<213> Homo sapiens 

<400> 179 

cccaacgcgt ttgcaaatat tcccctggta gcctacttcc ttacccccga atattggtaa 60 

gatcgagcaa tggcttcagg acatgggttc tcttctcctg tgatcattca agtgctcact 120 

gcatgaagac tggcttgtct cagtgtttca acctcaccag ggctgtctct tggtccacac 180 

ctcgctccct gttagtgccg tatgacagcc cccatcaaat gaccttggcc aagtcacggt 24 0 

ttctctgtgg tcaaggttgg ttggctgatt ggtggaaagt agggtggacc aaaggaggcc 300 

acgtgagcag tcagcaccag ttctgcacca gcagcgcctc cgtcctagtg ggtgttcctg 360 

tttctcctgg ccctgggtgg gctagggcct gattcgggaa gatgcctttg cagggagggg 420 

aggataagtg ggatctacca attgattctg gcaaaacaat ttctaagatt tttttgcttt 480 

atgtgggaaa cagatctaaa tctcatttta tgctgtattt t 521 

<210> 180 
<211> 417 
<212> DNA 
<213> Homo sapiens 



<400> 180 

ggtggaattc gccgaagatg gcggaggtgc 
tcctgggccg cctggcggcc atcgtggcta 
tcgtacgctg tgaaggcatc aacatttctg 
tggctttcct ccgcaagcgg atgaacacca 
cccccagccg catcttctgg cggaccgtgc 
gccaggccgc tctggaccgt ctcaaggtgt 
aaaagcggat ggtggttcct gctgccctca 

<210> 181 
<211> 283 
<212> DNA 
<213> Homo sapiens 



aggtcctggt gcttgatggt cgaggccatc 60 
aacaggtact gctgggccgg aaggtggtgg 12 0 
gcaatttcta cagaaacaag ttgaagtacc 180 
acccttcccg aggcccctac cacttccggg 240 
gaggtatgct gccccacaaa accaagcgag 300 
ttgacggcat cccaccgccc tacgacaaga 360 
aggtcgtgcg tctgaagcct acaagaa 417 



<220> 
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<221> misc_feature 
<222> 35 

<223> n - A,T,C or G 
<400> 181 

gatttcttct aaataggatg taaaacttct ttcanattac tcttcctcag tcctgcctgc 60 
caagaactca agtgtaactg tgataaaata acctttccca ggtatattgg caggtatgtg 120 
tgtaatctca gaatacacag gtgacataga tatgatatga caactggtaa tggtggattc 180 
atttacattg tttacacttc tatgaccagg ccttaaggga aggtcagttt tttaaaaaac 240 
caagtagtgt cttcctacct atctccagat acatgtcaaa aaa 283 

<210> 182 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<400> 182 

atattcttgc tgcttatgca gctgacattg ttgccctccc taaagcaacc aagtagcctt 60 
tatttcccac agtgaaagaa aacgctggcc tatcagttac attacaaaag gcagatttca 120 
agaggattga gtaagtagtt ggatggcttt cataaaaaca agaattcaag aagaggattc 180 
atgctttaag aaacatttgt tatacattcc tcacaaatta tacctgggat aaaaactatg 240 
tagcaggcag tgtgttttcc ttccatgtct ctctgcacta cctgcagtgt gtcctctgag 300 
gctgcaagtc tgtcctatct gaattcccag cagaagcact aagaagctcc accctatcac 360 
ctagcagata aaactatggg gaaaacttaa atctgtgcat a 401 

<210> 183 
<211> 366 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 325 

<223> n = A,T,C or G 



<400> 183 

accgtgtcca agtttttaga acccttgtta gccagaccga ggtgtcctgg tcaccgtttc 60 
accatcatgc tttgatgttc ccctgtcttt ctctcttctg ctctcaagag caaaggttaa 120 
tttaaggaca aagatgaagt cactgtaaac taatctgtca ttgtttttac cttccttttc 180 
tttttcagtg cagaaattaa aagtaagtat aaagcaccgt gattgggagt gtttttgcgt 240 
gtgtcggaat cactggtaaa tgttggctga gaacaatccc tccccttgca cttgtgaaaa 300 
cactttgagc gctttaagag attancctga gaaataatta aatatctttt ctcttcaaaa 360 
aaaaaa 366 

<210> 184 
<211> 370 
<212> DNA 
<213> Homo sapiens 

<400> 184 

tcttacttca aaagaaaaat aaacataaaa aataagttgc tggttcctaa caggaaaaat 60 
tttaataatt gtactgagag aaactgctta cgtacacatt gcagatcaaa tatttggagt 120 
taaaatgtta gtctacatag atgggtgatt gtaactttat tgccattaaa agatttcaaa 180 
ttgcattcat gcttctgtgt acacataatg aaaaatgggc aaataatgaa gatctctcct 24 0 
tcagtctgct ctgtttaatt ctgctgtctg ctcttctcta atgctgcgtc cctaattgta 300 
cacagtttag tgatatctag gagtataaag ttgtcgccca tcaataaaaa tcacaaagtt 360 
ggtttaaaaa 370 



WO 02/47534 



95 



PCT/US01/47576 



<210> 185 
<211> 107 
<212> DNA 
<213> Homo sapiens 



<400> 185 

ctcatattat tttccttttg agaaattgga aactctttct gttgctatta tattaataaa 60 
gttggtgttt attttctggt agtcaccttc cccatttaaa aaaaaaa 107 

<210> 186 
<211> 309 
<212> DNA 
<213> Homo sapiens 



<400> 186 

gaaaggatgg ctctggttgc cacagagctg 
agagggccac aggggtggcc gggagttgtc 
gccagtgagt gacagtcatg agggagtgtc 
ttctgtctga atgaaaggcc aaggctacag 
tgcccacgta gtggaggcct ctggcagatc 
tttatggtt 

<210> 187 
<211> 477 
<212> DNA 
<213> Homo sapiens 



ggacttcatg ttcttctaga gagggccaca 60 
agctgatgcc tgctgagagg caggaattgt 120 
tcttcttggg gaggaaagaa ggtagagcct 18 0 
tacagggccc cgccccagcc agggtgttaa 240 
ctgcattcca aggtcactgg actgtacgtt 300 

309 



<400> 187 

ttcagtccta gcaagaagcg agaattctga gatcctccag aaagtcgagc agcacccacc 60 

tccaacctcg ggccagtgtc ttcaggcttt actggggacc tgcgagctgg cctaatgtgg 120 

tggcctgcaa gccaggccat ccctgggcgc cacagacgag ctccgagcca ggtcaggctt 180 

cggaggccac aagctcagcc tcaggcccag gcactgattg tggcagaggg gccactaccc 24 0 

aaggtctagc taggcccaag acctagttac ccagacagtg agaagcccct ggaaggcaga 300 

aaagttggga gcatggcaga cagggaaggg aaacattttc agggaaaaga catgtatcac 360 

atgtcttcag aagcaagtca ggtttcatgt aaccgagtgt cctcttgcgt gtccaaaagt 420 

agcccagggc tgtagcacag gcttcacagt gattttgtgt tcagccgtga gtcacac 477 

<210> 188 

<211> 220 

<212> DNA 

<213> Homo sapiens 

<400> 188 

taaatatggt agatattaat attcctctta gatgaccagt gattccaatt gtcccaagtt 60 

ttaaataagt accctgtgag tatgagataa attagtgaca atcagaacaa gtttcagtat 12 0 

cagatgttca agaggaagtt gctattgcat tgattttaat atttgtacat aaacactgat 180 

ttttttgagc attattttgt atttgttgta ctttaatacc 220 

<210> 189 
<211> 417 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 76, 77 

<223> n = A,T,C or G 
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<400> 189 

accatcttga cagaggatac atgctcccaa 
ccatcattaa gcatcnnttt caaaattata 
tatcattatt ctagtccttt gaatttgtaa 
atgcactttt ctccagcaca tcagatttca 
gcacttgcta gtactacaca ctttgtacaa 
agaaaagcct tcctttgttg gcccttaaac 
tctgacgata cctgtatgtt cttattgtgt 

<210> 190 
<211> 497 
<212> DNA 
<213> Homo sapiens 



aacgtttgtt accacactta aaaatcactg 60 
gccattcatg atttactttt tccagatgac 120 
ggggaaaaaa aacaaaaaca aaaacttacg 180 
aattgaaaat taaagacatg ctatggtaat 240 
caaaaaacag aggcaagaaa caacggaaag 300 
tgagtcaaga tctgaaatgt agagatgatc 360 
aaataaaatt gctggtatga aatgaca 417 



<400> 190 

gcactgcggc gctctcccgt cccgcggtgg ttgctgctgc tgccgctgct gctgggcctg 60 

aacgcaggag ctgtcattga ctggcccaca gaggagggca aggaagtatg ggattatgtg 120 

acggtccgca aggatgccta catgttctgg tggctctatt atgccaccaa ctcctgcaag 180 

aacttctcag aactgcccct ggtcatgtgg cttcagggcg gtccaggcgg ttctagcact 240 

ggatttggaa actttgagga aattgggccc cttgacagtg atctcaaacc acggaaaacc 300 

acctggctcc aggctgccag tctcctattt gtggataatc ccgtgggcac tgggttcagt 360 

tatgtgaatg gtagtggtgc ctatgccaag gacctggcta tggtggcttc agacatgatg 420 

gttctcctga agaccttctt cagttgccac aaagaattcc agacagttcc attctacatt 480 
ttctcagagt cctatgg 497 

<210> 191 
<211> 175 
<212> DNA 
<213> Homo sapiens 

<400> 191 

atgttgaata ttttgcttat taactttgtt tattgtcttc tccctcgatt agaatattag 60 

ctacttgagt acaaggattt gagcctgtta cattcactgc tgaattttag gctcctggaa 120 

gatacccagc attcaataga gaccacacaa taaatatatg tcaaataaaa aaaaa 175 

<210> 192 
<211> 526 
<212> DNA 

<213> Homo sapiens 
<400> 192 

agtaaacatt attatttttt ttatatttgc aaaggaaaca tatctaatcc ttcctataga 60 
aagaacagta ttgctgtaat tccttttctt ttcttcctca tttcctctgc cccttaaaag 120 
attgaagaaa gagaaacttg tcaactcata tccacgttat ctagcaaagt acataagaat 180 
ctatcactaa gtaatgtatc cttcagaatg tgttggttta ccagtgacac cccatattca 24 0 
tcacaaaatt aaagcaagaa gtccatagta atttatttgc taatagtgga tttttaatgc 300 
tcagagtttc tgaggtcaaa ttttatcttt tcacttacaa gctctatgat cttaaataat 360 
ttacttaatg tattttggtg tattttcctc aaattaatat tggtgttcaa gactatatct 420 
aattcctctg atcactttga gaaacaaact tttattaaat gtaaggcact tttctatgaa 480 
ttttaaatat aaaaataaat attgttctga ttattactga aaaaaa 52 6 

<210> 193 
<211> 553 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc feature 
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<222> 290, 300, 411, 441 
<223> n = A,T,C or G 

<400> 193 

tccattgtgg tggaattcgc tctctggtaa aggcgtgcag gtgttggccg cggcctctga 60 

gctgggatga gccgtgctcc cggtggaagc aagggagccc agccggagcc atggccagta 120 

cagtggtagc agttggactg accattgctg ctgcaggatt tgcaggccgt tacgttttgc 180 

aagccatgaa gcatatggag cctcaagtaa aacaagtttt tcaaagccta ccaaaatctg 240 

ccttcagtgg tggctattat agaggtgggt ttgaacccaa aatgacaaan cgggaagcan 300 

cattaatact aggtgtaagc cctactgcca ataaagggaa aataagagat gctcatcgac 360 

gaattatgct tttaaatcat cctgacaaag gaggatctcc ttatatagca nccaaaatca 420 

atgaagctaa agatttacta naaggtcaag ctaaaaaatg aagtaaatgt atgatgaatt 480 

ttaagttcgt attagtttat gtatatgagt actaagtttt tataataaaa tgcctcagag 54 0 

ctacaatttt aaa * 553 

<210> 194 
<211> 320 
<212> DNA 
<213> Homo sapiens 

<400> 194 

cccttcccaa tccatcagta aagaccccat ctgccttgtc catgccgttt cccaacaggg 60 
atgtcacttg atatgagaat ctcaaatctc aatgccttat aagcattcct tcctgtgtcc 120 
attaagactc tgataattgt ctcccctcca taggaatttc tcccaggaaa gaaatatatc 180 
cccatctccg tttcatatca gaactaccgt ccccgatatt cccttcagag agattaaaga 240 
ccagaaaaaa gtgagcctct tcatctgcac ctgtaatagt ttcagttcct attttcttcc 300 
attgacccat atttatacct ~ 320 

<210> 195 

<211> 320 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> 203, 218 
<223> n - A,T,C or G 

<400> 195 

aagcatgacc tggggaaatg gtcagacctt gtattgtgtt tttggccttg aaagtagcaa 60 

gtgaccagaa tctgccatgg caacaggctt taaaaaagac ccttaaaaag acactgtctc 120 

aactgtggtg ttagcaccag ccagctctct gtacatttgc tagcttgtag ttttctaaga 180 

ctgagtaaac ttcttatttt tanaaagggg aggctggntt gtaactttcc ttgtacttaa 240 

ttgggtaaaa gtcttttcca caaaccacca tctattttgt gaactttgtt agtcatcttt 300 

tatttggtaa attatgaact ' 320 

<210> 196 
<211> 357 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 36 

<223> n = A,T,C or G 



<400> 196 

atataaaata atacgaaact ttaaaaagca ttggantgtc agtatgttga atcagtagtt 



60 
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tcactttaac tgtaaacaat ttcttaggac accatttggg ctagtttctg tgtaagtgta 120 

aatactacaa aaacttattt atactgttct tatgtcattt gttatattca tagatttata 180 

tgatgatatg acatctggct aaaaagaaat tattgcaaaa ctaaccacta tgtacttttt 240 

tataaatact gtatggacaa aaaatggcat tttttatatt aaattgttta gctctggcaa 300 

aaaaaaaaaa ttttaagagc tggtactaat aaaggattat tatgactgtt aaaaaaa 357 

<210> 197 
<211> 565 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 27 

<223> n = A,T,C or G 
<400> 197 

tcagctgagt accatcagga tatttanccc tttaagtgct gttttgggag tagaaaacta 60 

aagcaacaat acttcctctt gacagctttg attggaatgg ggttattaga tcattcacct 120 

tggtcctaca ctttttagga tgcttggtga acataacacc acttataatg aacatccctg 180 

gttcctatat tttgggctat gtgggtagga attgttactt gttactgcag cagcagccct 240 

agaaagtaag cccagggctt cagatctaag ttagtccaaa agctaaatga tttaaagtca 300 

agttgtaatg ctaggcataa gcactctata atacattaaa ttataggccg agcaattagg 360 

gaatgtttct gaaacattaa acttgtattt atgtcactaa aattctaaca caaacttaaa 420 

aaatgtgtct catacatatg ctgtactagg cttcatcatg catttctaaa tttgtgtatg 480 

atttgaatat atgaaagaat ttatacaaga gtgttattta aaattattaa aaataaatgt 54 0 

atataatttg tacctattgt aaaaa 565 



<210> 198 
<211> 484 
<212> DNA 
<213> Homo sapiens 



<400> 198 

tatgtaagta ttggtgtctg ctttaaaaaa 
acatttgaga acagtgttac tctgagcagt 
ctgttggatg tgtccattgt cgccagtttg 
tgggcgcagc agcaggtggc aggggtgtgg 
tctctggtgc tttctgagag ggtctctaaa 
agcacgtatt tctcccctct agtacctctg 
agggcagcag actcttgagt atactgcaga 
tccaggggct caactgacca agtaacacag 
aaac 



ggagacccag acttcacctg tcctttttaa 60 
tgggccacct tcaccttatc cgacagctga 120 
gctgttgccc ggacaggaca ggacctccat 180 
cttgaggtgg gtggcagcgt ctggtcctcc 240 
gcagagtgtg gttggcctgg gggaaggcag 300 
catttgtgag tgttccctct ggctttctga 360 
ggacatgctt tatcagtagg tcctgagggc 420 
aagttggggt atgtggccta tttgggtcgg 4 80 

484 



<210> 199 
<211> 429 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 77, 88, 134, 151, 189, 227, 274, 319 
<223> n = A, T, C or G 



• <400> 199 

gcttatgttt tttgttttaa cttttgtttt 

tacagtacct ttctcanaca ttttgtanaa 

gaacattaaa aagngtgata gcgatattag 



ttaacattta gaatattaca ttttgtatta 60 
ttcatttcgg cagctcacta ggattttgct 120 
ngccaatcaa atggaaaaaa ggtagtctta 180 
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ataaacaana cacaacgttt ttatacaaca tactttaaaa tattaanaaa actccttaat 24 0 

attgtttcct attaagtatt attctttggg caanattttc tgatgctttt gattttctct 300 

caatttagca tttgctttng gtttttttct ctatttagca ttctgttaag gcacaaaaac 360 

tatgtactgt atgggaaatg ttgtaaatat taccttttcc acattttaaa cagacaactt 420 

tgaatccaa 429 

<210> 200 
<211> 279 
<212> DNA 

<213> Homo sapiens 
<400> 200 

gcttttttga ggaattacag ggaagctcct ggaattgtac atggatatct ttatccctag 60 
ggggaaatca aggagctggg cacccctaat tctttatgga agtgtttaaa actattttaa 120 
ttttattaca agtattacta gagtagtggt tctactctaa gatttcaaaa gtgcatttaa 180 
aatcatacat gttcccgcct gcaaatatat tgttattttg gtggagaaaa aaatagtata 240 
ttctacataa aaaattaaag atattaacta agaaaaaaa 279 

<210> 201 
<211> 569 
<212> DNA 

<213> Homo sapiens 
<400> 201 

taggtcagta tttttagaaa ctcttaatag ctcatactct tgataccaaa agcagccctg 60 
attgttaaag cacacacctg cacaagaagc agtgatggtt gcatttacat ttcctgggtg 120 
cacaaaaaaa aattctcaaa aagcaaggac ttacgctttt tgcaaagcct ttgagaagtt 180 
actggatcat aggaagctta taacaagaat ggaagattct taaataactc actttctttg 24 0 
gtatccagta acagtagatg ttcaaaatat gtagctgatt aataccagca ttgtgaacgc 300 
tgtacaacct tgtggttatt actaagcaag ttactactag cttctgaaaa gtagcttcat 360 
aattaatgtt atttatacac tgccttccat gacttttact ttgccctaag ctaatctcca 420 
aaatctgaaa tgctactcca atatcagaaa aaaaggggga ggtggaatta tatttcctgt 480 
gattttaaga gtacagagaa tcatgcacat ctctgattag ttcatatatg tctagtgtgt 540 
aataaaagtc aaagatgaac tctcaaaaa 569 

<210> 202 
<211> 501 
<212> DNA 

<213> Homo sapiens 
<400> 202 

attaataggc ttaataattg ttggcaagga tccttttgct ttctttggca tgcaagctcc 60 
tagcatctgg cagtggggcc aagaaaataa ggtttatgca tgtatgatgg ttttcttctt 120 
gagcaacatg attgagaacc agtgtatgtc aacaggtgca tttgagataa ctttaaatga 180 
tgtacctgtg tggtctaagc tggaatctgg tcaccttcca tccatgcaac aacttgttca 240 
aattcttgac aatgaaatga agctcaatgt gcatatggat tcaatcccac accatcgatc 300 
atagcaccac ctatcagcac tgaaaactct tttgcattaa gggatcattg caagagcagc 360 
gtgactgaca ttatgaaggc ctgtactgaa gacagcaagc tgiitagtaca gaccagatgc 420 
tttcttggca ggctcgttgt acctcttgga aaacctcaat gcaagatagt gtttcagtgc 480 
tggcatattt tggaattctg c 501 

<210> 203 
<211> 261 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc feature 
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<222>36, 96 

<223> n = A, T, C or G 

<400> 203 

gacaagctcc tggtcttgag atgtcttctc 
gataaaatga atgagttctg tcatgattca 
gttagctctt tgaatgttct tgaaatttta 
tatcattgta taaaagctgt tatgtgcaac 
aatacttaaa cactgaaaaa a 

<210> 204 
<211> 421 
<212> DNA 
<213> Homo sapiens 



gttaangaga tgggcctttt ggaggtaaag 60 
ctattntata acttgcatga cctttactgt 120 
gactttcttt gtaaacaaat gatatgtcct 180 
agtgtggaga ttccttgtct gatttaataa 240 

261 



<400> 204 

agcatctttt ctacaacgtt aaaattgcag aagtagctta tcattaaaaa acaacaacaa 60 

caacaataac aataaatcct aagtgtaaat cagttattct accccctacc aaggatatca 120 

gcctgttttt tccctttttt ctcctgggaa taattgtggg cttcttccca aatttctaca 180 

gcctctttcc tcttctcatg cttgagcttc cctgtttgca cgcatgcgtg tgcaggactg 240 

gcttgtgtgc ttggactcgg ctccaggtgg aagcatgctt tcccttgtta ctgttggaga 300 

aactcaaacc ttcaagccct aggtgtagcc attttgtcaa gtcatcaact gtatttttgt 360 

actggcatta acaaaaaaag aagataaaat attgtaccat taaactttaa taaaacttta 420 

a 421 

<210> 205 
<211> 460 
<212> DNA 
<213> Homo sapiens 

<400> 205 

tactctcaca atgaaggacc tggaatgaaa aatctgtgtc taaacaagtc ctctttagat 60 

tttagtgcaa atccagagcc agcgtcggtt gcctcgagta attctttcat gggtaccttt 120 

ggaaaagctc tcaggagacc tcacctagat gcctattcaa gctttggaca gccatcagat 180 

tgtcagccaa gagcctttta tttgaaagct cattcttccc cagacttgga ctctgggtca 24 0 

gaggaagatg ggaaagaaag gacagatttt caggaagaaa atcacatttg tacctttaaa 300 

cagactttag aaaactacag gactccaaat tttcagtctt atgacttgga cacatagact 360 

gaatgagacc aaaggaaaag cttaacatac tacctcaagg tgaactttta tttaaaagag 420 

agagaatctt atgtttttta aatggagtta tgaattttaa 460 

<210> 206 
<211> 481 
<212> DNA 
<213> Homo sapiens 

<400> 206 

tgtggtggaa ttcgggacgc ccccagaccc tgactttttc ctgcgtgggc cgtctcctcc 60 
tgcggaagca gtgacctctg acccctggtg accttcgctt tgagtgcctt ttgaacgctg 120 
gtcccgcggg acttggtttt ctcaagctct gtctgtccaa agacgctccg gtcgaggtcc 180 
cgcctgccct gggtggatac ttgaacccca gacgcccctc tgtgctgctg tgtccggagg 240 
cggccttccc atctgcctgc ccacccggag ctctttccgc cggcgcaggg tcccaagccc 300 
acctcccgcc ctcagtcctg cggtgtgcgt ctgggcacgt cctgcacaca caatgcaagt 360 
cctggcctcc gcgcccgccc gcccacgcga gccgtacccg ccgccaactc tgttatttat 420 
ggtgtgaccc cctggaggtg ccctcggccc accggggcta tttattgttt aatttatttg 480 
t 481 



<210> 207 
<211> 605 
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<221> misc_feature 
<222> 20 f 21 f 61 
<223> n = A, T, C or G 

<400> 210 

cgccttgggg agccggcggn ngagtccggg acgtggagac ccggggtccc ggcagccggg 60 

nggcccgcgg gcccagggtg gggatgcacc gccgcggggt gggagctggc gccatcgcca 120 

agaagaaact tgcagaggcc aagtataagg agcgagggac ggtcttggct gaggaccagc 180 

tagcccagat gtcaaagcag ttggacatgt tcaagaccaa cctggaggaa tttgccagca 240 

aacacaagca ggagatccgg aagaatcctg agttccgtgt gcagttccag gacatgtgtg 300 

caaccattgg cgtggatccg ctggcctctg gaaaaggatt ttggtctgag atgctgggcg 360 

tgggggactt ctattacgaa ctaggtgtcc aaattatcga agtgtgcctg gcgctgaagc 420 

atcggaatgg aggtctgata actttggagg aactacatca acaggtgttg aagggaaggg 480 

gcaagttcgc ccaggatgtc agtcaagatg acctgatcag agccatcaag aaa 533 

<210> 211 
<211> 451 
<212> DNA 
<213> Homo sapiens 

<400> 211 

ttagcttgag ccgagaacga ggcgagaaag ctggagaccg aggagaccgc ctagagcgga 60 

gtgaacgggg aggggaccgt ggggaccggc ttgatcgtgc gcggacacct gctaccaagc 120 

ggagcttcag caaggaagtg gaggagcgga gtagagaacg gccctcccag cctgaggggc 180 

tgcgcaaggc agctagcctc acggaggatc gggaccgtgg gcgggatgcc gtgaagcgag 240 

aagctgccct acccccagtg agccccctga aggcggctct ctctgaggag gagttagaga 300 

agaaatccaa ggctatcatt gaggaatatc tccatctcaa tgacatgaaa gaggcagtcc 360 

agtgcgtgca ggagctggcc tcaccctcct tgctcttcat ctttgtacgg catggtgtcg 420 

agtctacgct ggagcgcagt gccattgctc g 451 

<210> 212 
<211> 471 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__feature 
<222> 54 

<223> n m A,T,C or G 
<400> 212 

gtgattattc ttgatcaggg agaagatcat ttagatttgt tttgcattcc ttanaatgga 60 
gggcaacatt ccacagctgc cctggctgtg atgagtgtcc ttgcaggggc cggagtagga 120 
gcactggggt gggggcggaa ttggggttac tcgatgtaag ggattccttg ttgttgtgtt 180 
gagatccagt gcagttgtga tttctgtgga tcccagcttg gttccaggaa ttttgtgtga 240 
ttggcttaaa tccagttttc aatcttcgac agctgggctg gaacgtgaac tcagtagctg 300 
aacctgtctg acccggtcac gttcttggat cctcagaact ctttgctctt gtcggggtgg 360 
gggtgggaac tcacgtgggg agcggtggct gagaaaatgt aaggattctg gaatacatat 420 
tccatgggac tttccttccc tctcctgctt cctcttttcc tgctccctaa c 471 

<210> 213 
<211> 511 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 27, 63, 337, 442 
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<212> DNA 

<213> Homo sapiens 
<400> 207 

accctttttg gattcagggc tcctcacaat 
tatagaagca tccctttgta tactgttttg 
ctcactggat tctcacggta ggatttctga 
ttttttgatc ctagggtgct ccttttgttt 
ggtggcagaa ttggcaccat tacccaggtc 
tgtatcatga aatgatttga aatcattgta 
tttccttgtg ctttgataac aaagactcca 
aagggctaga ttgggatttg aagacaaaat 
aacattaatg aaagcaaaac attataaaag 
tcttgatgct tccaaatgac atctaccaga 
cataa 

<210> 208 
<211> 655 
<212> DNA 
<213> Homo sapiens 



taaaatgagt gtaatgaaac aaggtgaaaa 60 
ctacttacag tgtacttggc attgctttat 120 
gatcttaatc taagctccaa agttgtctac 180 
tacagagcag ggtcacttga tttgctagct 240 
tgactgacca ccagtcagag gcactttatt 300 
aagcagcgaa gtctgataat gaatgccagc 360 
aatattctgg agaacctgga taaaagtttg 420 
tgtaggaaat cttacatttt tgcaataaca 480 
taattttaat tcaccacata cttatcaatt 540 
tatggttttg tggacatctt tttctgttta 600 

605 



<400> 208 

ggcgttgttc tggattcccg tcgtaactta aagggaaact ttcacaatgt ccggagccct 60 
tgatgtcctg caaatgaagg aggaggatgt ccttaagttc cttgcagcag gaacccactt 120 
aggtggcacc aatcttgact tccagatgga acagtacatc tataaaagga aaagtgatgg 180 
catctatatc ataaatctca agaggacctg ggagaagctt ctgctggcag ctcgtgcaat 240 
tgttgccatt gaaaaccctg ctgatgtcag tgttatatcc tccaggaata ctggccagag 300 
ggctgtgctg aagtttgctg ctgccactgg agccactcca attgctggcc gcttcactcc 360 
tggaaccttc actaaccaga tccaggcagc cttccgggag ccacggcttc ttgtggttac 420 
tgaccccagg gctgaccacc agcctctcac ggaggcatct tatgttaacc tacctaccat 480 
tgcgctgtgt aacacagatt ctcctctgcg ctatgtggac attgccatcc catgcaacaa 54 0 
caagggagct cactcagtgg gtttgatgtg gtggatgctg gctcgggaag ttctgcgcat 600 
gcgtggcacc atttcccgtg aacacccatg ggaggtcatg cctgatctgt acttc 655 



<210> 209 
<211> 621 
<212> DNA 
<213> Homo sapiens 

<400> 209 

catttagaac atggttatca tccaagacta 
caaatccaca ttcctcttga gttctgcagc 
gccgtagaat cacatgatct gaggaccatt 
gagtcttcca taaagttttg catggagcaa 
tcagccctct aaaagcatag ggcttagcct 
tagttttgta aacactatag catctgttaa 
gccgtgactc tggactatat cagtttttgg 
ccacgtggac cagtctgaat gtctttcctt 
aagaaacaat ctaaacaagt ttctgttgca 
gtaggcttct atattgcatt taacttgttt 
ctattgatga ataaagaaat t 



ctctaccctg caacattgaa ctcccaagag 60 
ttctgtgtaa atagggcagc tgtcgtctat 120 
catggaagct gctaaatagc ctagtctggg 180 
acaaacagga ttaaactagg tttggttcct 24 0 
gcaggcttcc ttgggctttc tctgtgtgtg 300 
gatccagtgt ccatggaaac cttcccacat 360 
aaagcagggt tcctctgcct gctaacaagc 420 
tacacctatg tttttaaata gtcaaacttc 480 
tatgtgtttg tgaacttgta tttgtattta 540 
ttgtaactcc tgattcttcc ttttcggata 600 

621 



<210> 210 
<211> 533 
<212> DNA 
<213> Homo sapiens 



<220> 
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<223> n = A,T,C or G 
<400> 213 

ctaattagaa acttgctgta ctttttnttt tcttttaggg gtcaaggacc ctctttatag 60 

ctnccatttg cctacaataa attattgcag cagtttgcaa tactaaaata ttttttatag 120 

actttatatt tttccttttg ataaagggat gctgcatagt agagttggtg taattaaact 180 

atctcagccg tttccctgct ttcccttctg ctccatatgc ctcattgtcc ttccagggag 240 

ctcttttaat cttaaagttc tacatttcat gctcttagtc aaattctgtt acctttttaa 300 

taactcttcc cactgcatat ttccatcttg aattggnggt tctaaattct gaaactgtag 360 

ttgagataca gctatttaat atttctggga gatgtgcatc cctcttcttt gtggttgccc 420 

aaggttgttt tgcgtaactg anactccttg atatgcttca gagaatttag gcaaacactg 480 
gccatggccg tgggagtact gggagtaaaa t 511 

<210> 214 

<211> 521 

<212> DNA 

<213> Homo sapiens 

<400> 214 

agcattgcca aataatccct aattttccac taaaaatata atgaaatgat gttaagcttt 60 

ttgaaaagtt taggttaaac ctactgttgt tagattaatg tatttgttgc ttccctttat 120 

ctggaatgtg gcattagctt ttttatttta accctcttta attcttattc aattccatga 180 

cttaaggttg gagagctaaa cactgggatt tttggataac agactgacag ttttgcataa 240 

ttataatcgg cattgtacat agaaaggata tggctacctt ttgttaaatc tgcactttct 300 

aaatatcaaa aaagggaaat gaagtataaa tcaatttttg tataatctgt ttgaaacatg 360 

agttttattt gcttaatatt agggctttgc cccttttctg taagtctctt gggatcctgt 420 

gtagaagctg ttctcattaa acaccaaaca gttaagtcca ttctctggta ctagctacaa 480 

attcggtttc atattctact taacaattta aataaactga a 521 

<210> 215 
<211> 381 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 17, 20, 60, 61, 365 

<223> n = A,T,C or G 

<400> 215 

gagcggagag cggaccngtn agagccctga gcagccccac cgccgccgcc ggcctagttn 60 
ncatcacacc ccgggaggag ccgcagctgc cgcagccggc cccagtcacc atcaccgcaa 120 
ccatgagcag cgaggccgag acccagcagc cgcccgccgc cccccccgcc gcccccgccc 180 
tcagcgccgc cgacaccaag cccggcacta cgggcagcgg cgcagggagc ggtggcccgg 240 
gcggcctcac atcggcggcg cctgccggcg gggacaagaa ggtcatcgca acgaaggttt 300 
tgggaacagt aaaatggttc aatgtaagga acggatatgg tttcatcaac aggaatgaca 360 
ccaangaaga tgtatttgta c 381 

<210> 216 
<211> 425 
<212> DNA 
<213> Homo sapiens 

<400> 216 

ttactaacta ggtcattcaa ggaagtcaag ttaacttaaa catgtcacct aaatgcactt 60 

gatggtgttg aaatgtccac cttcttaaat ttttaagatg aacttagttc taaagaagat 120 

aacaggccaa tcctgaaggt actccctgtt tgctgcagaa tgtcagatat tttggatgtt 180 

gcataagagt cctatttgcc ccagttaatt caacttttgt ctgcctgttt tgtggactgg 240 
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ctggctctgt tagaactctg tccaaaaagt gcatggaata taacttgtaa agcttcccac 300 
aattgacaat atatatgcat gtgtttaaac caaatccaga aagcttaaac aatagagctg 360 
cataatagta tttattaaag aatcacaact gtaaacatga gaataactta aggattctag 420 
tttag ~ " 425 

<210>- 217 
<211> 181 
<212> DNA 
<213> Homo sapiens 

<400> 217 

gagaaaccaa atgataggtt gtagagcctg atgactccaa acaaagccat cacccgcatt 60 

cttcctcctt cttctggtgc tacagctcca agggcccttc accttcatgt ctgaaatgga 120 

actttggctt tttcagtgga agaatatgtt gaaggtttca ttttgttcta gaaaaaaaaa 180 

a 181 

<210> 218 
<211> 405 
<212> DNA 
<213> Homo sapiens 



<400> 218 

caggccttcc agttcactga caaacatggg 
agtgatacca tcaagcctga tgtccaaaag 
gcgctgggct gttttagtgc caggctgcgg 
tatttttttt ttccattagt aaaacacaag 
acaaggcagg cctttcctac agggggtgga 
ggcctgagtt ggcgttgtgg gcaggctact 
attaatcttt tgtagtttgt attaaacttg 

<210> 219 
<211> 216 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 207, 210 
<223> n « A,T,C or G 



gaagtgtgcc cagctggctg gaaacctggc 60 
agcaaagaat atttctccaa gcagaagtga 120 
tgggcagcca tgagaacaaa acctcttctg 180 
acttcagatt cagccgaatt gtggtgtctt 240 
gagaccagcc tttcttcctt tggtaggaat 300 
ggtttgtatg atgtattagt agagcaaccc 360 
aactgagaaa aaaaa 4 05 



<400> 219 

actccaagag ttagggcagc agagtggagc 
ttaatttacc atgtaaaatt gctgtaaatg 
tcaattgtaa acttcttgtt aagactgtta 
aaaaataaaa aggaaagaac cctcttnaan 



gatttagaaa gaacatttta aaacaatcag 60 
ataatgtgta cagattttct gttcaaatat 120 
cgtttctatt gcttttgtat gggatattgc 180 
aaaaaa 216 



<210> 220 
<211> 380 
<212> DNA 
<213> Homo sapiens 



<400> 220 

cttacaaatt gcccccatgt gtaggggaca 
tctgtacaaa gtctttgcct ttttccttct 
atttcatctt tgagggaaac tgattagatg 
gcaccccaag gactcagaag atgattttaa 
gcatgtaata atgttgagtg gcagtcaaaa 
tgcattgaaa aggaaaacct gtctgagaaa 



cagaaccctt tgagaaaact tagatttttg 60 
tcattttttt ccagtacatt aaatttgtca 120 
ggttgtgttt gtgttctgat ggagaaaaca 180 
cagttcagaa cagatgtgtg caatattggt 24 0 
gtcatgattt ttatcttagt tcttcattac 300 
atgcctgaca gtttaattta aaactatggt 360 
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gtaagtcttt gacaaaaaaa 38 0 

<210> 221 

<211> 398 

<212> DNA 

<213> Homo sapiens 

<400> 221 

ggttagtaag ctgtcgactt tgtaaaaaag ttaaaaatga aaaaaaaagg aaaaatgaat 60 

tgtatattta atgaatgaac atgtacaatt tgccactggg aggaggttcc tttttgttgg 120 

gtgagtctgc aagtgaattt cactgatgtt gatattcatt gtgtgtagtt ttatttcggt 180 

cccagccccg tttcctttta ttttggagct aatgccagct gcgtgtctag ttttgagtgc 240 

agtaaaatag aatcagcaaa tcactcttat ttttcatcct tttccggtat tttttgggtt 300 

gtttctgtgg gagcagtgta caccaactct tcctgtatat tgcctttttg ctggaaaatg 360 

ttgtatgttg aataaaattt tctataaaaa ttaaaaaa 398 

<210> 222 
<211> 301 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 49, 64 

<223> n = A,T,C or G 

<400> 222 

ttcgataatt gatctcatgg gctttccctg gaggaaaggt tttttttgnt gtttattttt 60 

taanaacttg aaacttgtaa actgagatgt ctgtagcttt tttgcccatc tgtagtgtat 120 

gtgaagattt caaaacctga gagcactttt tctttgttta gaattatgag aaaggcacta 18 0 

gatgacttta ggatttgcat ttttcccttt attgcctcat ttcttgtgac gccttgttgg 240 

ggagggaaat ctgtttattt tttcctacaa ataaaaagct aagattctat atcgcaaaaa 300 
a 301 

<210> 223 
<211> 200 
<212> DNA 
<213> Homo sapiens 

<400> 223 

gtaagtgctt aggaagaaac tttgcaaaca tttaatgagg atacactgtt catttttaaa 60 

attccttcac actgtaattt aatgtgtttt atattctttt gtagtaaaac aacataactc 120 

agatttctac aggagacagt ggttttattt ggattgtctt ctgtaatagg tttcaataaa 18 0 
gctggatgaa cttaaaaaaa 200 

<210> 224 
<211> 385 
<212> DNA 
<213> Homo sapiens 

<400> 224 

gaaaggtttg atccggactc aaagaaagca aaggagtgtg agccgccatc tgctggagca 60 
gctgtaactg caagacctgg acaagagatt cgtcagcgaa ctgcagctca aagaaacctt 120 
tctccaacac cagcaagccc taaccagggc cctcctccac aagttccagt atctcctgga 180 
ccaccaaagg acagttctgc ccctggtgga cccccagaaa ggactgttac tccagcccta 24 0 
tcatcaaatg tgttaccaag acatcttgga tcccctgcta cttcagtgcc tggaatgggt 300 
aaacagagca cttaatgtta tttacagttt atattgtttt ctctggttac caataaaacg 360 
ggccattttc aggtggtaaa aaaaa -385 
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<210> 225 
<211> 560 
<212> PRT 

<213>^ Homo sapiens 



<400> 225 
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405 



Ser 


Asp 


Pro Thr Cys Glu He Thr 






420 


Asp 


Val 


Asp Glu Met Cys Leu Leu 






435 440 


Ser 


Gly 


Thr Tyr Cys Val Asn Leu 




450 
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Ala 


Leu 


Thr Ser Thr Leu He Ser 


465 




470 


Pro 


Leu 


Arg Met Ala Asn Ser Ala 






485 


He 


Phe 


Val Thr Val He Ser Leu 






500 


Tyr 


Asn 


Pro He Glu Asn Ser Pro 






515 520 


Leu 


Ser 


Val Phe Leu Asn Arg Ala 




530 


535 


Gin 


Glu 


Lys Asp Pro Leu Leu Lys 


545 




550 



410 415 
Gin Asn Thr Val Cys Ser Pro Val" 
425 430 
Thr Val Arg Arg Thr Phe Asn Gly 
445 

Thr Leu Gly Asp Asp Thr Ser Leu 
460 

Val Pro Asp Arg Asp Pro Ala Ser 
475 480 
Leu He Ser Val Gly Cys Leu Ala 

490 495 
Leu Val Tyr Lys Lys His Lys Glu 
505 " 510 

Gly Asn Val Val Arg Ser Lys Gly 
525 

Lys Ala Val Phe Phe Pro Gly Asn 
540 

Asn Gin Glu Phe Lys Gly Val Ser 
555 560 



<210> 226 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 226 

He Leu He Pro Ala Thr Trp Lys Ala 

1 5 . 



<210> 227 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 227 

Phe Leu Leu Asn Asp Asn Leu Thr Ala 
1 5 



<210> 228 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 228 

Leu Leu Gly Asn Cys Leu Pro Thr Val 
1 5 



<210> 229 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 229 

Lys Leu Leu Gly Asn Cys Leu Pro Thr Val 
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10 



<210> 230 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 230 

Arg Leu Thr Gly Gly Leu Lys Phe Phe Val 
15 10 



<210> 231 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 231 

Ser Leu Gin Ala Leu Lys Val Thr Val 
1 5 



<210> 232 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 232 

Ala Gly Ala Asp Val lie Lys Asn Asp Gly He Tyr Ser Arg Tyr Phe 

15 10 15 

Phe Ser Phe Ala 
20 



<210> 233 
<211> 21 
<212> PRT 

<213> Homo sapiens 
<400> 233 

Phe Phe Ser Phe Ala Ala Asn Gly Arg Tyr Ser Leu Lys Val His Val 

15 10 15 

Asn His Ser Pro Ser 
20 



<210> 234 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 234 

Phe Leu Val Thr Trp Gin Ala Ser Gly Pro Pro Glu lie He Leu Phe 

1 5 .10 15 

Asp Pro Asp Gly 
20 
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<210> 235 

<211> 20 

<212> PRT 

<213> Homo sapiens 



<400> 235 

Leu Gin Ser Ala Val Ser Asn lie 

1 5 
Pro Asn Ser Asp 
20 



Ala Gin Ala Pro Leu Phe lie Pro 
10 15 



<210> 236 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 236 

lie Gin Asp Asp Phe Asn Asn Ala He Leu Val Asn Thr Ser Lys Arg 

15 10 15 

Asn Pro Gin Gin' 
20 



<210> 237 
<211> 21 
<212> PRT 

<213> Homo sapiens 



<400> 237 

Arg Asn Ser Leu Gin Ser Ala Val Ser Asn He Ala Gin Ala Pro Leu 

15 10 15 

Phe He Pro Pro Asn 
20 



<210> 238 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 238 

Thr His Glu Ser His Arg He Tyr Val Ala He Arg Ala Met Asp Arg 

15 10 15 

Asn Ser Leu Gin 
20 



<210> 239 

<211> 20 

<212> PRT 

<213> Homo sapiens 



<400> 239 

Arg Asn Pro Gin Gin Ala Gly He Arg Glu He Phe Thr Phe Ser Pro 

15 10 15 

Gin He Ser Thr 
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20 



<210> 240 
<211> 21 
<212> PRT 

<213> Homo sapiens 
<400> 240 

Gly Gin Ala Thr Ser Tyr Glu He Arg Met Ser Lys Ser Leu Gin Asn 

1 5 10 15 

He Gin Asp Asp Phe 
20 



<210> 241 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 241 

Glu Arg Lys Trp Gly Phe Ser Arg Val Ser Ser Gly Gly Ser Phe Ser 

1 5 10 15 

Val Leu Gly Val 
20 



<210> 242 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 242 

Gly Ser His Ala Met Tyr Val Pro Gly Tyr Thr Ala Asn Gly Asn He 

1 5 10 15 

Gin Met Asn Ala 
20 



<210> 243 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 243 

Val Asn His Ser Pro Ser He Ser Thr Pro Ala His Ser He Pro Gly 

15 10 15 

Ser His Ala Met 
20 



<210> 244 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 244 

Ala Val Pro Pro Ala Thr Val Glu Ala Phe Val Glu Arg Asp Ser Leu 
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10 



15 



His Phe Pro His 
20 



<210> 245 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 245 

Lys Pro Gly His Trp Thr Tyr Thr Leu Asn Asn Thr His His Ser Leu 

15 10 15 

Gin Ala Leu Lys 
20 



<210> 246 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 246 

Asn Leu Thr Phe Arg Thr Ala Ser Leu Trp lie Pro Gly Thr Ala Lys 

1 5 10 15 

Pro Gly His Trp 
20 



<210> 247 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 247 

Leu His Phe Pro His Pro Val Met lie Tyr Ala Asn Val Lys Gin Gly 

15 10 15 

Phe Tyr Pro He 
20 



<210> 248 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 248 

Pro Glu Thr Gly Asp Pro Val Thr Leu Arg Leu Leu Asp Asp Gly Ala 

15 10 15 

Gly Ala Asp Val 
20 



<210> 249 

<211> 20 

<212> PRT 

<213> Homo sapiens 
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<400> 249 

Gly Phe Tyr Pro lie Leu Asn Ala Thr Val Thr Ala Thr Val Glu Pro 

1 5 10 15 

Glu Thr Gly Asp 
20 



<210> 250 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 250 

Phe Asp Pro Asp Gly Arg Lys Tyr Tyr Thr Asn Asn Phe He Thr Asn 

1 5 10 15 

Leu Thr Phe Arg 
20 



<210> 251 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 251 

Leu Gin Ala Leu Lys Val Thr Val Thr Ser Arg Ala Ser Asn Ser Ala 

15 10 15 

Val Pro Pro Ala 
20 



<210> 252 
<211> 153 
<212> PRT 

<213> Homo sapiens 
<400> 252 

Met Ala Ser Val Arg Val Ala Ala Tyr Phe Glu Asn Phe Leu Ala Ala 

1 5 10 15 

Trp Arg Pro Val Lys Ala Ser Asp Gly Asp Tyr Tyr Thr Leu Ala Val 

20 - 25 30 

Pro Met Gly Asp Val Pro Met Asp Gly He Ser Val Ala Asp He Gly 

35 40 45 

Ala Ala Val Ser Ser He Phe Asn Ser Pro Glu Glu Phe Leu Gly Lys 

50 55 , 60 

Ala Val Gly Leu Ser Ala Glu Ala Leu Thr He Gin Gin Tyr Ala Asp 
65 70 75 80 

Val Leu Ser Lys Ala Leu Gly Lys Glu Val Arg Asp Ala Lys lie Thr 

85 90 95 

Pro Glu Ala Phe Glu Lys Leu Gly Phe Pro Ala Ala Lys Glu He Ala 

100 105 110 

Asn Met Cys Arg Phe Tyr Glu Met Lys Pro Asp Arg Asp Val Asn Leu 

115 120 125 

Thr His Gin Leu Asn Pro Lys Val Lys Ser Phe Ser Gin Phe He Ser 

130. 135 140 

Glu Asn Gin Gly Ala Phe Lys Gly Met 
145 150 
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<210> 253 
<211> 462 
<212> DNA 
<213> Homo sapiens 

<400> 253 

atggccagtg tccgcgtggc ggcctacttt gaaaactttc tcgcggcgtg gcggcccgtg 60 

aaagcctctg atggagatta ctacaccttg gctgtaccga tgggagatgt accaatggat 120 

ggtatctctg ttgctgatat tggagcagcc gtctctagca tttttaattc tccagaggaa 180 

tttttaggca aggccgtggg gctcagtgca gaagcactaa caatacagca atatgctgat 240 

gttttgtcca aggctttggg gaaagaagtc cgagatgcaa agattacccc ggaagctttc 300 

gagaagctgg gattccctgc agcaaaggaa atagccaata tgtgtcgttt ctatgaaatg 360 

aagccagacc gagatgtcaa tctcacccac caactaaatc ccaaagtcaa aagcttcagc 420 

cagtttatct cagagaacca gggagccttc aagggcatgt ag 4 62 

<210> 254 

<211> 8031 

<212> DNA 

<213> Homo sapiens 

<400> 254 

tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 
tccgctcatg aattaattct tagaaaaact catcgagcat caaatgaaac tgcaatttat 600 
tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat gaaggagaaa 660 
actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg attccgactc 720 
gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta tcaagtgaga 780 
aatcaccatg agtgacgact gaatccggtg agaatggcaa aagtttatgc atttctttcc 840 
agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca tcaaccaaac 900 
cgttattcat tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg ttaaaaggac 960 
aattacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca tcaacaatat 1020 
tttcacctga atcaggatat tcttctaata cctggaatgc tgttttcccg gggatcgcag 1080 
tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc ggaagaggca 1140 
taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg gcaacgctac 1200 
ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaat cgatagattg 12 60 
tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa tcagcatcca 1320 
tgttggaatt taatcgcggc ctagagcaag acgtttcccg ttgaatatgg ctcataacac 1380 
cccttgtatt actgtttatg taagcagaca gttttattgt tcatgaccaa aatcccttaa 1440 
cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 1500 
gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 1560 
gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 1620 
agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 1680 
aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 1740 
agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 1800 
cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 1860 
accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga 1920 
aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 1980 
ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 204 0 
cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 2100 
gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 2160 
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tcccctgatt ctgtggataa ccgtattacc 
agccgaacga ccgagcgcag cgagtcagtg 
tattttctcc ttacgcatct gtgcggtatt 
caatctgctc tgatgccgca tagttaagcc 
ggtcatggct gcgccccgac acccgccaac 
gctcccggca tccgcttaca gacaagctgt 
gttttcaccg tcatcaccga aacgcgcgag 
gtgaagcgat tcacagatgt ctgcctgttc 
aagcgttaat gtctggcttc tgataaagcg 
ggtcactgat gcctccgtgt aagggggatt 
acgagagagg atgctcacga tacgggttac 
ttgtgagggt aaacaactgg cggtatggat 
tcaatgccag cgcttcgtta atacagatgt 
tgcgatgcag atccggaaca taatggtgca 
cgaaacacgg aaaccgaaga ccattcatgt 
gcagtcgctt cacgttcgct cgcgtatcgg 
ccgccagcct agccgggtcc tcaacgacag 
catgccggcg ataatggcct gcttctcgcc 
ggcttgagcg agggcgtgca agattccgaa 
gctccagcga aagcggtcct cgccgaaaat 
gagttgcatg ataaagaaga cagtcataag 
ccggaaggag ctgactgggt tgaaggctct 
atgagtgagc taacttacat taattgcgtt 
cctgtcgtgc cagctgcatt aatgaatcgg 
tgggcgccag ggtggttttt cttttcacca 
ccgcctggcc ctgagagagt tgcagcaagc 
aatcctgttt gatggtggtt aacggcggga 
atcccactac cgagatatcc gcaccaacgc 
cgcccagcgc catctgatcg ttggcaacca 
gcatttgcat ggtttgttga aaaccggaca 
tcggctgaat ttgattgcga gtgagatatt 
agacagaact taatgggccc gctaacagcg 
gctccacgcc cagtcgcgta ccgtcttcat 
ggtcagagac atcaagaaat aacgccggaa 
catcctggtc atccagcgga tagttaatga 
tgtgcaccgc cgctttacag gcttcgacgc 
tggcacccag ttgatcggcg cgagatttaa 
gggccagact ggaggtggca acgccaatca 
ccacgcggtt gggaatgtaa ttcagctccg 
tcgcagaaac gtggctggcc tggttcacca 
catactctgc gacatcgtat aacgttactg 
cttccgggcg ctatcatgcc ataccgcgaa 
tctcgacgct ctcccttatg cgactcctgc 
ccgttgagca ccgccgccgc aaggaatggt 
ccggccacgg ggcctgccac catacccacg 
cgagcccgat cttccccatc ggtgatgtcg 
gcgccggtga tgccggccac gatgcgtccg 
aattaatacg actcactata ggggaattgt 
ttttgtttaa ctttaagaag gagatataca 
acagcttcaa gacaatgggt ataatggatt 
gaatcagaac ctcatctcaa acattaagga 
taatgctacc aagagaagag tatttttcag 
gaaagctaat aataacagca aaataaaaca 
gactgactgg tatggggcac atggagatga 
aaaagaggga aaatacattc atttcacacc 
tggctacgga tcacgaggcc gagtgtttgt 
gttcgatgag tataacaatg acaaaccttt 
gacaaggtgt tcatctgaca tcacaggcat 



gcctttgagt gagctgatac cgctcgccgc 2220 
agcgaggaag cggaagagcg cctgatgcgg 2280 
tcacaccgca tatatggtgc actctcagta 2340 
agtatacact ccgctatcgc tacgtgactg 2400 
acccgctgac gcgccctgac gggcttgtct 2460 
gaccgtctcc gggagctgca tgtgtcagag 2520 
gcagctgcgg taaagctcat cagcgtggtc 2580 
atccgcgtcc agctcgttga gtttctccag 2640 
ggccatgtta agggcggttt tttcctgttt 2700 
tctgttcatg ggggtaatga taccgatgaa 27 60 
tgatgatgaa catgcccggt tactggaacg 2820 
gcggcgggac cagagaaaaa tcactcaggg 2880 
aggtgttcca cagggtagcc agcagcatcc 2940 
gggcgctgac ttccgcgttt ccagacttta 3000 
tgttgctcag gtcgcagacg ttttgcagca 3060 
tgattcattc tgctaaccag taaggcaacc 3120 
gagcacgatc atgcgcaccc gtggggccgc 3180 
gaaacgtttg gtggcgggac cagtgacgaa 3240 
taccgcaagc gacaggccga tcatcgtcgc 3300 
gacccagagc gctgccggca cctgtcctac 3360 
tgcggcgacg atagtcatgc cccgcgccca 3420 
caagggcatc ggtcgagatc ccggtgccta 3480 
gcgctcactg cccgctttcc agtcgggaaa 3540 
ccaacgcgcg gggagaggcg gtttgcgtat 3600 
gtgagacggg caacagctga ttgcccttca 3660 
ggtccacgct ggtttgcccc agcaggcgaa 3720 
tataacatga gctgtcttcg gtatcgtcgt 3780 
gcagcccgga ctcggtaatg gcgcgcattg 3840 
gcatcgcagt gggaacgatg ccctcattca 3900 
tggcactcca gtcgccttcc cgttccgcta 3960 
tatgccagcc agccagacgc agacgcgccg 4020 
cgatttgctg gtgacccaat gcgaccagat 4080 
gggagaaaat aatactgttg atgggtgtct 4140 
cattagtgca ggcagcttcc acagcaatgg 4200 
tcagcccact gacgcgttgc gcgagaagat 4260. 
cgcttcgttc taccatcgac acca.ccacgc 4320 
tcgccgcgac aatttgcgac ggcgcgtgca 4380 
gcaacgactg tttgcccgcc agttgttgtg 4440 
ccatcgccgc ttccactttt tcccgcgttt 4500 
cgcgggaaac ggtctgataa gagacaccgg 4560 
gtttcacatt caccaccctg aattgactct 4620 
aggttttgcg ccattcgatg gtgtccggga 4680 
attaggaagc agcccagtag taggttgagg 4740 
gcatgcaagg agatggcgcc caacagtccc 4800 
ccgaaacaag cgctcatgag cccgaagtgg 4860 
gcgatatagg cgccagcaac cgcacctgtg 4920 
gcgtagagga tcgagatctc gatcccgcga 4980 
gagcggataa caattcccct ctagaaataa 5040 
tatgcagcat caccaccatc accacggagt 5100 
gctcattgca attaatcctc aggtacctga 5160 
aatgataact gaagcttcat tttacctatt 5220 
aaatataaag attttaatac ctgccacatg 5280 
agaatcatat gaaaaggcaa atgtcatagt 5340 
tccatacacc ctacaataca gagggtgtgg 5400 
taatttccta ctgaatgata acttaacagc 5460 
ccatgaatgg gcccacctcc gttggggtgt 5520 
ctacataaat gggcaaaatc aaattaaagt 5580 
ttttgtgtgt gaaaaaggtc cttgccccca 5 640 
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agaaaactgt 
ccaaaatgca 
taatgcaagt 
aagtgcatgg 
gactgagctt 
tttagtgctg 
agccgcagaa 
tttcgacagc 
aaagttgctg 
ttcagggctt 
tgtgatgata 
gctcagcagt 
ggaggaatta 
caatagcatg 
acatattcag 
agtgactgtg 
tggtcctcct 
tatcaccaat 
gcactggact 
gacctctcgc 
agacagcctc 
tcccattctt 
gctgagactc 
gaggtatttt 
ctctcccagc 
aggttacaca 
tgaggaggag 
gggagttcca 
agctgtaaaa 
tcagggccag 
tgactttaac 
cagggagata 
tggagaaaca 
cttacagtct 
tcctgtacct 
aggaatcatt 
agacaagaaa 
ggcggccgct 
aggaagctga 
ctaaacgggt 



attattagta 
actgcatcaa 
acccacaacc 
gatgtaatca 
ccacctcctc 
gatgtgtcca 
ttttatttga 
aaaggagaga 
gtttcatatc 
aagaaaggat 
ttagtgacca 
ggttcaacaa 
tcacgtctta 
attgatgctt 
cttgaaagta 
gataatactg 
gagattatat 
ctaacttttc 
tacaccctga 
gcctccaact 
cattttcctc 
aatgccactg 
cttgatgatg 
ttctcctttg 
ataagcaccc 
gcaaacggta 
cgaaagtggg 
gctggccccc 
gtagaagagg 
gctacaagct 
aatgctattt 
tttacgttct 
catgaaagcc 
gctgtatcta 
gccagagatt 
tgccttatta 
gagaatggaa 
cgagcaccac 
gttggctgct 
cttgaggggt 



agctttttaa 
taatgttcat 
aagaagcacc 
cagactctgc 
ccacattctc 
gcaagatggc 
tgcagattgt 
tcagagccca 
tgcccaccac 
ttgaggtggt 
gcggagatga 
ttcactccat 
caggaggttt 
tcagtagaat 
caggtgaaaa 
tgggcaacga 
tatttgatcc 
ggacagctag 
acaataccca 
cagctgtgcc 
atcctgtgat 
tcactgccac 
gagcaggtgc 
ctgcaaatgg 
cagcccactc 
atattcagat 
gctttagccg 
accctgatgt 
aattgaccct 
atgaaataag 
tagtaaatac 
caccccaaat 
acagaattta 
acattgccca 
atcttatatt 
tagttgtgac 
caaaattatt 
caccaccacc 
gccaccgctg 
tttttgctga 



agaaggatgc 
gcaaagttta 
aaacctacag 
tgactttcac 
gcttgtagag 
agaggctgac 
tgaaattcat 
gctacaccaa 
tgtatcagct 
tgaaaaactg 
taagcttctt 
tgccctgggt 
aaagttcttt 
ttcctctgga 
tgtcaaacct 
cactatgttt 
tgatggacga 
tctttggatt 
tcattctctg 
cccagccact 
gatttatgcc 
agttgagcca 
tgatgttata 
tagatatagc 
tattccaggg 
gaatgctcca 
agtcagctca 
gtttccacca 
atcttggaca 
aatgagtaaa 
atcaaagcga 
ttccacgaat 
tgttgcaata 
ggcgcctctg 
gaaaggagtt 
acatcatact 
ataatgaatt 
actgagatcc 
agcaataact 
aaggaggaac 



acctttatct 
tcttctgtgg 
aaccagatgt 
cacagctttc 
gctggtgaca 
agactccttc 
accttcgtgg 
attaacagca 
aaaacagaca 
aatggaaaag 
ggcaattgct 
tcatctgcag 
gttccagata 
actggagaca 
caccatcaat 
ctagttacgt 
aaatactaca 
ccaggaacag 
caagccctga 
'gtggaagcct 
aatgtgaaac 
gagactggag 
aaaaatgatg 
ttgaaagtgc 
agtcatgcta 
aggaaatcag 
ggaggctcct 
tgcaaaatta 
gcacctggag 
agtctacaga 
aatcctcagc 
ggacctgaac 
cgagcaatgg 
tttattcccc 
ttaacagcaa 
ttaagcagga 
ctgcagatat 
ggctgctaac 
agcataaccc 
tatatccgga 



acaatagcac 
ttgaattttg 
gcagcctcag 
ccatgaacgg 
aagtggtctg 
aactacaaca 
gcattgccag 
atgatgatcg 
tcagcatttg 
cttatggctc 
tacccactgt 
ccccaaatct 
tatcaaactc 
ttttccagca 
tgaaaaacac 
ggcaggccag 
caaataattt 
ctaagcctgg 
aagtgacagt 
ttgtggaaag 
agggatttta 
atcctgttac 
gaatttactc 
atgtcaatca 
tgtatgtacc 
taggcagaaa 
tttcagtgct 
ttgacctgga 
aagactttga 
atatccaaga 
aagctggcat 
atcagccaaa 
ataggaactc 
ccaattctga 
tgggtttgat. 
aaaagagagc 
ccatcacact 
aaagcccgaa 
cttggggcct 
t 



.5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8031 



<210> 255 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 9, 67, 247, 275, 277, 397 

<223> n = A, T,C or G 

<400> 255 

gtggccagng actagaaggc gaggcgccgc 
agtccanagg acggagaaga cgaggaagag 
ggaattattg attcagactt cctctcaaaa 
gacactgaga ggcccattct gcaagtggac 



gggaccatgg cggcggcggc ggacgagcgg 60 
gaggagcagt tggttctggt ggaattatca 120 
tgtgaaaata aatgcaaggt tttgggcatt 180 
agctgtgtct ttgctgggga gtatgaagac 240 
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actctangga cctgtgttat atttgaagaa aatgntnaac atgctgatac agaaggcaat 300 
aataaaacag tgctaaaata taaatgccat acaatgaaga agctcagcat gacaagaact 360 
ctcctgacag agaagaagga aggagaagaa aacatangtg g 401 

<210> 256 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 7, 37, 51, 79, 96, 98, 103, 104, 107, 116, 167, 181, 183, 
194, 206, 276, 303, 307, 308, 310, 323, 332, 341, 353, 374, 
376 

<223> n = A,T,C or G 



<400> 256 

tggtggncct 

gggccggggt 

ggccgaggat 

nancaagccc 

catttgattt 

canaaannan 

tttaggcctc 



gggatgggga 
cgcggccgng 
aaggagtgga 
ctgnaggaga 
cttcctgggg 
accccgttgc 
attncngggg 



accgcggtgg 
gacggggccg 
tgcccgtcac 
tctatntctt 
gcctctctca 
ccngtccatc 
ggaaccttaa 



cttccgngga 
gggccnangc 
caacttgggc 
cttccctgcc 
aggatnaggt 
tncacccaac 
cccaatttgg 



ggtttcggca 
cgnnganctc 
cgcttgncca 
ccattaagga 
ttttgaagat 
ncttccaagg 
g 



ntggcatccg 60 
gcggangcaa 120 
aggacatgaa 180 
atcaagagat 240 
tatgccagtg 300 
gcnatttttg 360 
401 



<210> 257 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220>" 

<221> misc_feature 
<222> 382, 387 
<223> n = A,T,C or G 



<400> 257 

atgtatgtaa 

ctctcagccc 

ttaaaatatc 

taacaggctc 

gcctgttcag 

acagctcttt 

gaaactccca 



aacacttcat 
tgaggtatac 
tgctaagtaa 
cccactttct 
agtgtctaca 
tggttgcttt 
gttaaagcct 



aaaatgtaaa 
agaatcattt 
tttgctatgt 
tttaatgtgc 
gtaagagctg 
ccacttttct 
angctancaa 



gggctataac 
gcctcagact 
cttctcccac 
tgttatgagc 
gacaaactct 
gaaaggttca 
ttttttttag 



aaatatgtta 
gctgttggat 
actatcaata 
tttggacatg 
ggagggacac 
cagtaacctt 
t 



taaagtgatt 60 
tttaaaattt 120 
tgcctgcttc 180 
agataaccgt 240 
agtctttgag 300 
ctagataata 360 
401 



<210> 258 
<211> 401 
<212> DNA 
<213> Homo sapiens 



<400> 258 

ggagcgctag 

tgaggggccg 

tctgtggagg 

caattttcat 

gctactatga- 

ttcacaagtt 

attcagagag 



gtcggtgtac 
ggcccaagct 
agcagcagta 
ctttgcaatc 
tatcttaggt 
ggccatgaag 
attgcagaag 



gaccgagatt 
gccgacccga 
gtcggagggt 
tgcattttaa 
gtgccaaaat 
taccaccctg 
catatgaaac 



agggtgcgtg 
gccgatcgtc 
gcaggatatt 
tgataacaga 
cggcatcaga 
acaaaaataa 
actctcagat 



ccagctccgg 
agggtcgcca 
agaaatggct 
attaattctg 
gcgccaaatc 
gacccagatg 

g 



gaggccgcgg 60 
gcgcctcagc 12 0 
actccccagt 180 
gcctcaaaaa 240 
aagaaggcct 300 
ctgaagcaaa 360 
401 
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<210> 259 
<211> 401 
<212> DNA 

<213> Homo sapiens 
<400> 259 

attgggtttg gagggaggat gatgacagag gaatgccctt tggccatcac ggttttgatt 60 
ctccagaata ttgtgggttt gatcatcaat gcagtcatgt taggctgcat tttcatgaaa 120 
acagctcagg ctcacagaag ggcagaaact ttgattttca gccgccatgc tgtgattgcc 180 
gtccgaaatg gcaagctgtg cttcatgttc cgagtgggtg acctgaggaa aagcatgatc 240 
attagtgcct ctgtgcgcat ccaggtggtc aagaaaacaa ctacacctga aggggaggtg 300 
gttcctattc accaactgga cattcctgtt gataacccaa tcgagagcaa taacattttt 3 60 
ctggtggccc ctttgatcat ctgccacgtg attgacaagc g 401 

<210> 260 
<211> 363 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<222> 7, 9, 19, 41/ 63, 73, 106, 111, 113, 116, 119, 156, 158, 
162, 187, 247, 288, 289, 290, 292, 298, 299, 300, 340 
<223> n = A,T,C or G 

<400> 260 

aggaganang gagggggana tgaataggga tggagaggga natagtggat gagcagggca 60 

canggagagg aancagaaag gagaggcaag acagggagac acacancaca nangangana 120 

caggtggggg ctggggtggg gcatggagag cctttnangt cncccaggcc accctgctct 180 

cgctggnctg ttgaaaccca ctccatggct tcctgccact gcagttgggc ccagggctgg 240 

cttattnctg gaatgcaagt ggctgtggct tggagcctcc cctctggnnn anggaaannn 300 

attgctccct tatctgcttg gaatatctga gtttttccan cccggaaata aaacacacac 360 
aca 363 

<210> 261 
<211> 401 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 114, 152 
<223> n - A, T, C or G 



<400> 261 

cggctctccg ccgctctccc ggggtttcgg 
tcaccttccc ctgacctgag tagtcgccat 
cttccctgga tttgatgagc gggctgatgc 
gggcacagat gaggagagca tcctgactct 
ggaaatctct gcagctttta agactctgtt 
agaactaact ggaaaatttg aaaaattaat 
tgatgcttat gaactgaaac atgccttgaa 

<210> 262 
<211> 401 
<212> DNA 
<213> Homo sapiens 



ggcacttggg tcccacagtc tggtcctgct 60 
ggcacaggtt ctcagaggca ctgngactga 120 
anaaactctt cggaaggcta tgaaaggctt 180 
gttgacatcc cgaagtaatg ctcagcgcca 240 
tggcagggat cttctggatg acctgaaatc 300 
tgtggctctg atgaaaccct ctcggcttta 360 
gggagctgga a 401 
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<220> 

<221> mis cofeature 

<222> 7, 26, 258, 305, 358, 373, 374, 378 
<223> n = A,T,C or G 

<400> 262 

agtctanaac atttctaata ttttgngctt tcatatatca aaggagatta tgtgaaacta 60 
tttttaaata ctgtaaagtg acatatagtt ataagatata tttctgtaca gtagagaaag 120 
agtttataac atgaagaata ttgtaccatt atacattttc attctcgatc tcataagaaa 180 
ttcaaaagaa taatgataga ggtgaaaata tgtttacttt ctctaaatca agcctagttg 240 
tcaactcaaa aattatgntg catagtttta ttttgaattt aggttttggg actacttttt 300 
tccancttca atgagaaaat aaaatctaca actcaggagt tactacagaa gttctaanta 360 
tttttttgct aannagcnaa aaatataaac atatgaaaat g 401 

<210> 263 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 232, 290, 304, 326, 383 

<223> n m A,T,C or G 

<400> 263 

ctgtccgacc aagagaggcc ggccgagccc gaggcttggg cttttgcttt ctggcggagg 60 
gatctgcggc ggtttaggag gcggcgctga tcctgggagg aagaggcagc tacggcggcg 120 
gcggcggtgg cggctagggc ggcggcgaat aaaggggccg ccgccgggtg atgcggtgac 180 
cactgcggca ggcccaggag ctgagtgggc cccggccctc agcccgtccc gncggacccg 24 0 
ctttcctcaa ctctccatct tctcctgccg accgagatcg ccgaggcggn ctcaggctcc 300 
ctancccctt ccccgtccct tccccncccc cgtccccgcc ccgggggccg ccgccacccg 360 
cctcccacca tggctctgaa ganaatccac aaggaattga a 401 

<210> 264 
<211> 401 
<212> DNA 
<213> Homo sapiens 



<400> 264 

aacaccagcc actccaggac ccctgaaggc 
aagccacccc ctggcagaaa cttcagctgt 
actttggcca gcattgacct tcaaagtcag 
cttcacattt tcatcccctc ctgcatcatt 
ctaagaaaac aactctgtca aaagctgtat 
accacaacaa agagggaagt gaacagtgct 
gggtgacctg atatgacatc taaagaagct 

<210> 265 
<211> 271 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 59 

<223> n = A,T,C or G 



ctctaccagg tcaccagtgt tctgcgccta 60 
gtgttctgga atactcacgt gagggaactt 120 
atggaaccca ggacccatcc aacttggctg 180 
gctttcattt tcatagccac agtgatagcc 240 
tcttcaaaag acacaacaaa aagacctgtc 300 
gtgaatctga acctgtggtc ttgggagcca 360 
tctggactct g 401 



<400> 265 
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gccacttcct gtggacatgg gcagagcgct gctgccagtt cctggtagcc ttgaccacna 60 

cgctgggggg tctttgtgat ggtcatgggt ctcatttgca cttgggggtg tgggattcaa 120 

gttagaagtt tctagatctg gccgggcgca gtggctcaca cctgtaatcc cagcacttta 180 

ggaggctgag gcaggcggat catgaggtca ggagatcgag accgtcctgg ctaacacagt 240 

gaaaccccgt ctctactaaa aatacaaaaa a 271 

<210> 266 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> 45 

<223> n » A,T,C or G 
<400> 266 

attcataaat ttagctgaaa gatactgatt caatttgtat acagngaata taaatgagac 60 
gacagcaaaa ttttcatgaa atgtaaaata tttttatagt ttgttcatac tatatgaggt 120 
tctattttaa atgactttct ggattttaaa aaatttcttt aaatacaatc atttttgtaa 180 
tatttatttt atgcttatga tctagataat tgcagaatat cattttatct gactctgtct 240 
tcataagaga gctgtggccg aattttgaac atctgttata gggagtgatc aaattagaag 300 
gcaatgtgga aaaacaattc tgggaaagat ttctttatat gaagtccctg ccactagcca 360 
gccatcctaa ttgatgaaag ttatctgttc acaggcctgc a 401 

<210> 267 
<211> 401 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 116, 247, 277, 296, 307, 313, 322, 323, 336, 342, 355, 365, 

377, 378, 397 

<223> n » A,T,C or G 

<400> 267 

gaagaggcat cacctgatcc cggagacctt tggagttaag aggcggcgga agcgagggcc 60 

tgtggagtcg gatcctcttc ggggtgagcc agggtcggcg cgcgcggctg tctcanaact 120 

catgcagctg ttcccgcgag gcctgtttga ggacgcgctg ccgcccatcg tgctgaggag 180 

ccaggtgtac agccttgtgc ctgacaggac cgtggccgac cggcagctga aggagcttca 240 

agagcanggg gagacaaaat cgtccagctg ggcttcnact tggatgccca tggaanttat 300 

tctttcnctt ganggactta cnngggaccc aagaanccct tncaaggggc ccttngtgga 360 

tgggncccga aaccccnnta tttgcccttg ggggggncca a 401 

<210> 268 
<211> 223 
<212> DNA 
<213> Homo sapiens 

<400> 268 

tcgccatgtt ggccaggctg gtcttgaact cctgacttta agtgatccac ccgcctcaac 60 
ctcccaaagt gctgggatta caggtgtgag ccaccgcgcc tggcctgata catactttta 120 
gaatcaagta gtcacgcact ttttctgttc atttttctaa aaagtaaata tacaaatgtt 180 
ttgttttttg ttttttttgt ttgtttgttt ctgttttttt ttt 223 



<210> 269 
<211> 401 
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<212>. DNA 

<213> Homo sapiens 
<400> 269 

actatgtaaa ccacattgta ctttttttta ctttggcaac aaatatttat acatacaaga 60 

tgctagttca tttgaatatt tctcccaact tatccaagga tctccagctc taacaaaatg 120 

gtttattttt atttaaatgt caatagttgt tttttaaaat ccaaatcaga ggtgcaggcc 180 

accagttaaa tgccgtctat caggttttgt gccttaagag actacagagt caaagctcat 240 

ttttaaagga gtaggacaaa gttgtcacag gtttttgttg ttgtttttat tgcccccaaa 300 

attacatgtt aatttccatt tatatcaggg attctattta cttgaagact gtgaagttgc 360 

cattttgtct cattgttttc tttgacataa ctaggatcca t 401 

<210> 270 
<211> 401 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 240, 382 
<223> n = A,T,C or G 

<400> 270 

tggctgttga ttcacctcag cactgcttgg tatctgcacc ctacctctct ttagaggctg 60 

ccttgtcaac tgaaaaatgc acctgacttc gagcaagact ctttccttag gttctggatc 120 

tgtttgagcc ccatggcact gagctggaat ctgagggtct tgttccaagg atgtgatgat 180 

gtgggagaat gttctttgaa agagcagaaa tccagtctgc atggaaacag cctgtagagn 240 

agaagtttcc agtgataagt gttcactgtt ctaaggaggt acaccacagc tacctgaatt 300 

ttcccaaaat gagtgcttct gtgcgttaca actggccttt gtacttgact gtgatgactt 360 

tgttttttct tttcaattct anatgaacat gggaaaaaat g 401 

<210> 271 
<211> 329 
<212> DNA 

<213> Homo sapiens 
<400> 271 

ccacagcctc caagtcaggt ggggtggagt cccagagctg cacagggttt ggcccaagtt 60 
tctaagggag gcacttcctc ccctcgccca tcagtgccag cccctgctgg ctggtgcctg 120. 
agcccctcag acagccccct gccccgcagg cctgccttct cagggacttc tgcggggcct 180 
gaggcaagcc atggagtgag acccaggagc cggacacttc tcaggaaatg gcttttccca 240 
acccccagcc cccacccggt ggttcttcct gttctgtgac tgtgtatagt gccaccacag 300 
cttatggcat ctcattgagg acaaaaaaa 329 

<210> 272 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 1, 7, 12, 21, 61, 62, 66, 72, 78, 88, 90, 92, 98, 117, 119, 

128, 130, 134, 142, 144, 151, 159, 162, 164, 168, 169, 177, 

184, 185, 188, 194, 202, 204, 209, 213, 218, 223, 231, 260, 

272, 299, 300, 306, 321, .322, 323, 331, 335, 336, 338 
<223> n = A,T,C or G 

<221> misc feature 
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<222> 341, 342, 343, 345, 346, 351, 358, 360, 362, 363, 387, 390, 
392 

<223> n = A,T,C or G 
<400> 272 

nggctgntaa cntcggaggt nacttcctgg actatcctgg agaccccctc cgcttccacg 60 
nncatnatat cnctcatngc tgggcccntn angacacnat cccactccaa cacctgngng 120 
atgctggncn cctnggaacc ancntcagaa ngaccctgnt cntntgtnnt ccgcaanctg 180 
aagnnaangc gggntacacc tncntgcant ggnccacnct gcngggaact ntacacacct 240 
acgggatgtg gctgcgccan gagccaagag cntttctgga tgattcccca gcctcttgnn 300 
agggantcta caacattgct nnntaccttt ntccnncngc nnntnntgga ntacaggngn 360 
tnntaacact acatcttttt tactgcnccn tncttggtgg g 401 

<210>' 273 
<211> 401 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 399 

<223> n - A,T,C or G 
<400> 273 

cagcaccatg aagatcaaga tcatcgcacc 
tggctccatc ctggcctcac tgtccacctt 
cgacgagtcg ggcccctcca tcgtccaccg 
gtagcatttg ctgcatgggt taattgagaa 
ctcatgctag cctcacgaaa ctggaataag 
tatctgatat cagcactgga ttgtagaact 
aactgttccc cttggtatta acgtgtcagg 



cccagagcgc aagtactcgg tgtggatcgg 60 
ccagcagatg tggattagca agcaggagta 120 
caaatgcttc taaacggact cagcagatgc 180 
tagaaatttg cccctggcaa atgcacacac 240 
ccttcgaaaa gaaattgtcc ttgaagcttg 300 
tgttgctgat tttgaccttg tattgaagtt 360 
gctgagtgnt c 401 



<210> 274 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<400> 274 

ccacccacac ccaccgcgcc ctcgttcgcc 
cgccgcccag gccatcgcca ccctccgcag 
cctaccgcag gatgttcggc ggcccgggca 
acgtgactac gtccacccgc acctacagcc 
gcagcctcta cgcctcgtcc ccgggcggcg 
tgcggagcag cgtgcccggg gtgcggctcc 
acgccatcaa caccgagttc aagaacaccc 

<210> 275 
<211> 401 
<212> DNA 
<213> Homo sapiens 



tcttctccgg gagccagtcc gcgccaccgc 60 
ccatgtccac caggtccgtg tcctcgtcct 120 
ccgcgagccg gccgagctcc agccggagct 180 
tgggcagcgc gctgcgcccc agcaccagcc 240 
tgtatgccac gcgctcctct gccgtgcgcc 300 
tgcaggactc ggtggacttc tcgctggccg 360 
gcaccaacga g 401 



<400> 275 

ccacttccac cactttgtgg agcagtgcct 
ctggcctggg cctgggcttc gggagagcag 
gaagggactt acctcccaaa ggttctgcag 
agctcctggg tgtgtcagag gccagcctgg 
agggagaggg agaggggacc cgaggctgag 
gacacggoag tgatgctgcg gtctctcctc 



tcagcgcaac ccggatgcca ggtatccctg 60 
agggtgctca ggagggtaag gccagggtgt 120 
gggaatctgg agctacacac aggagggatc 180 
ggagctctgg ccactgcttc ccatgagctg 240 
gcataagtgg caggatttcg ggaagctggg 300 
ccctttccct ccaggcccag tgccagcacc 360 
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ctcctgaacc actctttctt caagcagatc aagcgacgtg c 401 

<210> 276 
<211> 401 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 
<222> 11 

<223> n - A, T,C or G 
<400> 276 

tctgatattg ntacccttga gccacctaag ttagaagaaa ttggaaatca agaagttgtc 60 
attgttgaag aagcacagag ttcagaagac tttaacatgg gctcttcctc tagcagccag 120 
tatactttct gtcagccaga aactgtattt tcatctcagc ctagtgatga tgaatcaagt 180 
agtgatgaaa ccagtaatca gcccagtcct gcctttagac gacgccgtgc taggaagaag 240 
accgtttctg cttcagaatc tgaagaccgg ctagttggtg aacaagaaac tgaaccttct 300 
aaggagttga gtaaacgtca gttcagtagt ggtctcaata agtgtgttat acttgctttg 360 
gtgattgcaa tcagcatggg atttggccat ttctatggca c 401 

<210> 277 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 227, 333 ' 
<223> n = A,T,C or G 

<400> 277 

aactttggca acatatctca gcaaaaacta cagctatgtt attcatgcca aaataaaagc 60 

tgtgcagagg agtggctgca atgaggtcac aacggtggtg gatgtaaaag agatcttcaa 120 

gtcctcatca cccatccctc gaactcaagt cccgctcatt acaaattctt cttgccagtg 180 

tccacacatc ctgccccatc aagatgttct catcatgtgt tacgagnggc gctcaaggat 240 

gatgcttctt gaaaattgct tagttgaaaa atggagagat cagcttagta aaagatccat 300 

acagtgggaa gagaggctgc aggaacagcg ganaacagtt caggacaaga agaaaacagc 360 

cgggcgcacc agtcgtagta atccccccaa accaaaggga a 401 

<210> 278 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> 322, 354 
<223> n = A,T/C or G 

<400> 278 

aatgagtgtg agaccacaaa tgaatgccgg gaggatgaaa tgtgttggaa ttatcatggc 60 

ggcttccgtt gttatccacg aaatccttgt caagatccct acattctaac accagagaac 12 0 

cgatgtgttt gcccagtctc aaatgccatg tgccgagaac tgccccagtc aatagtctac 180 

aaatacatga gcatccgatc tgataggtct gtgccatcag acatcttcca gatacaggcc 240 

acaactattt atgccaacac catcaatact tttcggatta aatctggaaa tgaaaatgga 300 

gagtctacct acgacaacaa anccctgtaa gtgcaatgct tgtgctcgtg aagncattat 360 

caggaccaag agaacatatc gtggacctgg agatgctgac a 401 
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<210> 279 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 30, 35, 81, 88, 180, 212, 378, 384, 391 
<223> n = A,T,C or G 



<400> 279 

aaattattgc ctctgataca tacctaagtn 
cattacttgg agggttgcag nttctaantg 
taggaaacaa gcatgaacgg cagtctagaa 
gccattatcc tgtggaatct gatatgtctg 
tctttggaaa tgatgagatt atttcctgtg 
aatgtgaaac tgaaactaat aattttgatc 
gctctaaata acaaaagnta gggngacaag 

<210> 280 
<211> 326 
<212> DNA 
<213> Homo sapiens 



aacanaacat taatacctaa gtaaacataa 60 
aaactgtatt tgaaactttt aagtatactt 120 
taccagaaac atctacttgg gtagcttggn 180 
gnagcatgtc attgatggga catgaagaca 240 
ttaaaaaaaa aaaaaatctt aaattcctac 300 
ctgatgtatg ggacagcgta tctgtaccag 360 
nacatgttcc t 401 



60 



<400> 280 

gaagtggaat tgtataattc aattcgataa ttgatctcat gggctttccc tggaggaaag 

gttttttttg ttgttttttt tttaagaact tgaaacttgt aaactgagat gtctgtagct 120 

tttttgccca tctgtagtgt atgtgaagat ttcaaaacct gagagcactt tttctttgtt 180 

tagaattatg agaaaggcac tagatgactt taggatttgc atttttccct ttattgcctc 24 0 

atttcttgtg acgccttgtt ggggagggaa atctgtttat tttttcctac aaataaaaag 300 

ctaagattct atatcgcaaa aaaaaa 326 

<210> 281 
<211> 374 
<212> DNA 
<213> Homo sapiens 

<400> 281 

caacgcgttt gcaaatattc ccctggtagc ctacttcctt acccccgaat attggtaaga 60 
tcgagcaatg gcttcaggac atgggttctc ttctcctgtg atcattcaag tgctcactgc 120 
atgaagactg gcttgtctca gtgtttcaac ctcaccaggg ctgtctcttg gtccacacct 180 
cgctccctgt tagtgccgta tgacagcccc catcaaatga ccttggccaa gtcacggttt 240 
ctctgtggtc aaggttggtt ggctgattgg tggaaagtag ggtggaccaa aggaggccac 300 
gtgagcagtc agcaccagtt ctgcaccagc agcgcctccg tcctagtggg tgttcctgtt 360 
tctcctggcc ctgg 3 *?4 

<210> 282 

<211> 404 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 26, 27, 51, 137, 180, 222 

<223> n = A,T,C or G 



<400> 282 
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agtgtggtgg aattcccgca tcctanncgc 
aaaattccag tgtcagcatt cttgctcctt 
accacagtca aacctgnagc caaaaaggac 
accctctcca gaggttgggg tgaccaactc 
tataaatcca agacaagcaa caaacccttg 
cacagtcaag . ctttaaagaa agtgtttgct 
cagtttgtcc tcctcaatct ggtttatgaa 

<210> 283 
<211> 184 
<212> DNA 
<213> Homo sapiens 



cgactcacac aaggcagagt ngccatggag 60 
gtggccctct cctacactct ggccagagat 120 
acaaaggact ctcgacccaa actgccccan 180 
atctggactc anacatatga agaagctcta 240 
atgattattc atcacttgga tgagtgccca 300 
gaaaataaag aaatccagaa attggcagag 360 
acaactgaca aaca 404 



<220> 

<221> misc_feature 
<222> 26 

<223> n = A,T,C or G 



<400> 283 

agtgtggtgg aattcacttg cttaanttgt gggcaaaaga gaaaaagaag gattgatcag 60 

agcattgtgc aatacagttt cattaactcc ttccctcgct cccccaaaaa tttgaatttt 120 

tttttcaaca ctcttacacc tgttatggaa aatgtcaacc tttgtaagaa aaccaaaata 180 
aaaa 184 

<210> 284 
<211> 421 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> 147, 149 
<223> n - A,T/C or G 



<400> 284 

ctattaatcc tgccacaata tttttaatta 
cccatttcac ccactgctct gtttggccgc 
gaggcggata ccctttcctc ggggaanana 
aaaaatgttg gaaagtcgag tggcaaagct 
gtcaaaagat ccagggtgcc tctctctgtt 
acctccagtc accatacaca ggttaccagt 
agtctctaaa tcaatctgaa tggtatcatt 

g 

<210> 285 
<211> 361 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^f eature 

<222> 34, 188 

<223> n = A,T,C or G 



cgtacaaaga tctgacatgt cacccaggga 60 
cagtcttttg tctctctctt cagcaatggt 120 
aatccatggt ttgttgccct tgccaataac 180 
gttgccattg gcatctttca cgtgaaccac 240 
ggtgatcaca ccaattcttc ctaggttagc 300 
gtcgaacttg atgaaatcag taatcttgcc 360 
caccttgatg aggggatcgg ggtagcggat 420 

421 



<400> 285 

ctgggtggta actctttatt tcattgtccg 
cactgtgcag gcttcagctt ccactccggg 
ctgccaggtg cacagccctg gctcccgagg 



gaanaaagat gggagtggga acagggtgga 60 
caggattcag gctatctggg accgcaggga 120 
caggcaggca aggtgacggg actggaagcc 180 
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cttttcanag ccttggagga gctggtccgt 
gcaggggatg gataaacagg gaaacactgt 
ggtgaagccc cggcgctgag ctaagctcag 
a 



ccacaagcaa tgagtgccac tctgcagttt 240 
gcattcctca cagccaacag tgtaggtctt 300 
gctgttccag ggagccacga aactgcaggt 360 

361 



<210> 286 
<211> 336 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 40, 68, 75, 127, 262 

<223> n - A,T,C or G 

<400> 286 

tttgagtggc agcgccttta tttgtggggg 
ggaggaanag ccganaaact gtgtgaccgg 
cttgcanatg cccattggca tcaccggtgc 
tcttgttcaa catagggtag gtggcagcca 
ggcgctccat tttgtgttcc angagcatgt 
tgaggatgtt ctcgatgcag ctgcgctggc 

<210> 287 

<211> 301 

<212> DNA 

<213> Homo sapiens 



ccttcaaggn agggtcgtgg ggggcagcgg 60 
ggcctcaggt ggtgggcatt gggggctcct 120 
agccattggt ggcagcgggt accggtcctt 180 
cgggtccaac tcgcttgagg ctgggccctg 240 
ggttctgtgg cgggagcccc acgcaggccc 300 
ggaaaa 336 



<220> 

<221> misc_feature 

<222> 15, 33, 44, 53, 76, 83, 107, 117, 154, 166, 192, 194, 207, 

215, 241, 246 

<223> n = A,T,C or G 

<400> 287 

tgggtaccaa atttntttat ttgaaggaat ggnacaaatc aaanaactta agnggatgtt 60 
ttggtacaac ttatanaaaa ggnaaaggaa accccaacat gcatgcnctg ccttggngac 120 
cagggaagtc accccacggc tatggggaaa ttancccgag gcttancttt cattatcact 18 0 
gtctcccagg gngngcttgt caaaaanata ttccnccaag ccaaattcgg gcgctcccat 240 
nttgcncaag ttggtcacgt ggtcacccaa ttctttgatg gctttcacct gctcattcag 300 
g " 301 

<210> 288 
<211> 358 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 39, 143, 226 
<223> n = A,T,C or G 

<400> 288 

aagtttttaa actttttatt tgcatattaa 
tttgaacaaa aaaaaaaatg gcactctgat 
gggccagctt ggttttdctc tanatttcac 
ttcttccttc accaacatgc aagttctttc 
gggaaaggca ggcgcggcct tcgttgtcag 



aaaaattgng cattccaata attaaaatca 60 
taaactgcat tacagcctgc aggacacctt 12 0 
tgtcgtccca ccccacttct tccaccccac 180 
cttccctgcc agccanatag atagacagat 240 
tagttctttg atgtgaaagg ggcagcacag 300 
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tcatttaaac ttgatccaac ctctttgcat cttacaaagt taaacagcta aaagaagt 358 

<210> 289 
<211> 462 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 87, 141, 182, 220, 269, 327 

<223> n = A, T,C or G 

<400> 289 

ggcatcagaa atgctgttta tttctctgct gctcccaagc tggctggcct ttgcagagga 60 
gcagacaaca gatgcatagt tgggganaaa gggaggacag gttccaggat agagggtgca 120 
ggctgaggga ggaagggtaa naggaaggaa ggccatcctg gatccccaca tttcagtctc 180 
anatgaggac aaagggactc ccaagccccc aaatcatcan aaaacaccaa ggagcaggag 240 
gagcttgagc aggccccagg gagcctcana gccataccag ccactgtcta cttcccatcc 300 
tcctctccca ttccctgtct gcttcanacc acctcccagc taagccccag ctccattccc 360 
ccaatcctgg cccttgccag cttgacagtc acagtgcctg gaattccacc actgaggctt 420 
ctcccagttg gattaggacg tcgccctgtt agcatgctgc cc 462 

<210> 290 
<211> 481 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 44, 57, 122, 158, 304, 325, 352, 405 
<223> n = A,T,C or G 

<400> 290 

tactttccta aactttatta aagaaaaaag caataagcaa tggnggtaaa tctctanaac 60 
atacccaatt ttctgggctt cctcccccga gaatgtgaca ttttgatttc caaacatgcc 120 
anaagtgtat ggttcccaac tgtactaaag taggtganaa gctgaagtcc tcaagtgttc 180 
atcttccaac ttttcccagt ctgtggtctg tctttggatc agcaataatt gcctgaacag 240 
ctactatggc ttcgttgatt tttgtctgta gctctctgag ctcctctatg tgcagcaatc 300 
gcanaatttg agcagcttca ttaanaactg catctcctgt gtcaaaacca anaatatgtt 360 
tgtctaaagc aacaggtaag ccctcttttg tttgatttgc cttancaact gcatcctgtg 420 
tcaggcgctc ctgaaccaaa atccgaattg ccttaagcat taccaggtaa tcatcatgac 480 
g " 481 

<210> 291 
<211> 381 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 79, 166, 187, 208, 219, 315 

<223> n = A, T, C or G 

<400> 291 

tcatagtaat gtaaaaccat ttgtttaatt ctaaatcaaa tcactttcac aacagtgaaa 60 
attagtgact ggttaaggng tgccactgta catatcatca ttttctgact ggggtcagga 120 
cctggtccta gtccacaagg gtggcaggag gagggtggag gctaanaaca cagaaaacac 180 
acaaaanaaa ggaaagctgc cttggcanaa ggatgaggng gtgagcttgc cgaaggatgg 240 
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tgggaagggg gctccctgtt ggggccgagc 
cttagctcct ggcanagggt gagtggggac 
ccagcctggc tttactaaca g 

<210> 292 
<211> 371 
<212> DNA 
<213> Homo 

<220> 
<221> misc_feature 
<222> 32, 55, 72, 151, 189, 292 
<223> n = A,T,C or G 



caggagtccc aagtcagctc tcctgcctta 300 
ctacgaggtt caaaatcaaa tggcatttgg 360 

381 



sapiens 



<400> 292 

gaaaaaataa tccgtttaat tgaaaaacct gnaggatact attccactcc cccanatgag 60 
gaggctgagg anaccaaacc cctacatcac ctcgtagcca cttctgatac tcttcacgag 120 
gcagcaggca aagacaattc ccaaaacctc nacaaaagca attccaaggg ctgctgcagc 180 
taccaccanc acatttttcc tcagccagcc cccaatcttc tccacacagc cctccttatg 240 
gatcgccttc tcgttgaaat taatcccaca gcccacagta acattaatgc ancaggagtc 300 
ggggactcgg ttcttcgaca tggaagggat tttctcccaa tctgtgtagt tagcagcccc 360 
acagcactta a 371 

<210> 293 
<211> 361 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 75, 196, 222 
<223> n - A,T,C or G 



<400> 293 

gatttaaaag aaaacacttt attgttcagc 
tccataattt attgngatgt tatcaacatc 
ctgttcatgt tttcttgaac acgtcttcaa 
tgaggtaacg aagcanaagt atttttaaac 
cctatatgct caatacatgc cgcgtgatcc 
tttggaaaac atctgttatg atgactttca 
c 

<210> 294 
<211> 391 
<212> DNA 
<213> Homo sapiens 



aattaaaagt tagccaaata tgtatttttc 60 
aagtaaaatg ctcattttca tcatttgctt 120 
ttttccttcc aaaatgctgc atgccacact 180 
atgacagcta anaacattca tctacagcaa 240 
tagtagtttt ttcacaacct tctacaagtt 300 
tacaccttca cctcaaaggc tttcttgcac 360 

361 



<220> 

<221> misc_feature 

<222> 26, 77, 96, 150, 203, 252, 254, 264, 276 
<223> n - A,T,C or G 



<400> 294 

tattttaaag tttaattatg attcanaaaa 
atattgactc tgtatanacc acagttattg 
tattttttat tctgaaaatg atattaatan 
tacatatgcc caaaatggct ganaataaat 
agggcatgca ananaaaatc tcanaatacc 



aatcgagcga ataactttct ctgaaaaaat 60 
gggganaagg gctggtaggt taaattatcc 120 
aaagtcccgt ttccagtctg attataaaga 180 
acaacaggaa atgcaaaagc tgtaaagcta 24 0 
caaagnggca acaaggaacg tttggctgga 300 
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atttgaagtt atttcagtca tctttgtctt tggctccatg tttcaggatg cgtgtgaact 360 
cgatgtaatt gaaattcccc tttttatcaa t 391 

<210> 295 
<211> 343 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc__feature 
<222> 145, 174, 205, 232 
<223> n = A,T,C or G 



<400> 295 

ttcttttgtt ttattgataa 
aaataataaa gaatgtgtct 
acaaatatag agttcttcac 
aattgtgctt ctacaaaacc 
cacatttcca ttattacact 
tctttcacaa aagccaagcc 



cagaaactgt gcataattac 
actgccagca aaatacaatt 
accanatggc tctggtgtaa 
ttcanagcat gaggtagttt 
tttagtgagc taaaatcctt 
tcatttacaa agggtttatt 



agatttgatg aggaatctgc 60 
attccatgcc ctctcaacat 120 
caaagccatt ttanatgttt 180 
cttttaccta cnatattttc 24 0 
ttaacatagc ctgcggatga 300 
tct 343 



<210> 296 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 96, 98, 106, 185 
<223> n = A, T,C or G 



<400> 296 

ttcttggata ttggttgttt ttgtgaaaaa 
tatttctcta ctttgccctc ctgatgccca 
cttccacttt ggaaaaaaaa aaaacctgtt 
gatanatcgc tctcaaaatc taaggctctg 
t 



gtttttgttt ttcttctcag tcaactgaat 60 
catgananaa cttaanataa tttctaacag 120 
ttcctcatgg aaccccagga gttgaaagtg 180 
ttcagcttta cattatgtta cctgacgttt 240 

241 



<210> 297 
<211> 391 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 12, 130 

<223> n » A,T,C or G 



<400> 297 

gttgtggctg anaatgctgg agatgctcag 
cttggtggtg ccctcacatc tggggtcttc 
gtcaggacan accatgtccg tgctaggccc 
ctctcccagg tttctggtcc cgatgggcaa 
catcccacta cccctcacat gctctcactc 
ttcacgaact ctcaaagaaa aggaaggata 
tggaaaagta caaaaagaca gccagaggtg 



ttctctccct cacaaggtag gccacaaatt 60 
aggcaccagc catgcctgcc gaggagtgct 120 
aggcacagcc caaccactcc tcatccaagt 180 
ggatgacccc tccagtggct ggtaccccac 240 
tccatcaggt ccccaatcct ggcttccctc 300 
aaacctaaat aaaccagaca gaagcagctc 360 
t 391 



<210> 298 
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<211> 321 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 14, 30, 76, 116, 201, 288, 301 
<223> n = A,T,C or G 

<400> 298 

caagccaaac tgtntccagc tttattaaan atactttcca taaacaatca tggtatttca 60 

ggcaggacat gggcanacaa tcgttaacag tatacaacaa ctttcaaact cccttnttca 120 

atggactacc aaaaatcaaa aagccactat aaaacccaat gaagtcttca tctgatgctc 180 

tgaacaggga aagtttaaag ngagggttga catttcacat ttagcatgtt gtttaacaac 240 

ttttcacaag ccgaccctga ctttcaggaa gtgaaatgaa aatggcanaa tttatctgaa 300 

natccacaat ctaaaaatgg a 321 



<210> 299 
<211> 401 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 
<222> 104, 268, 347 
<223> n = A, T,C or G 



<400> 299 

tatcataaag agtgttgaag tttatttatt 
attggtaaaa aaataaaaca aaaagcattt 
agaagtatca tttttctttg tcaaattata 
tggaattttg tcggtcactt gcactggttg 
agttaaattt tttttgttgg gatttcanat 
caacgtccac accaaattct tgatcaggac 
taggtagtct cacagccttg cgtgttcgat 



atagcaccat tgagacattt tgaaattgga 60 
gaattgtatt tggnggaaca gcaaaaaaag 120 
ctgtttccaa acattttgga aataaataac 180 
acaagattag aacaagagga acacatatgg 240 
agagtttggt ttataaaaag caaacagggc 300 
caccaatgtc atagggngca atatctacaa 360 
attcaaagac t 401 



<210> 300 
<211> 188 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 48 

<223> n « A,T,C or G 
<400> 300 

tgaatgcttt gtcatattaa gaaagttaaa gtgcaataat gtttgaanac aataagtggt 60 
ggtgtatctt gtttctaata agataaactt ttttgtcttt gctttatctt attagggagt 120 
tgtatgtcag tgtataaaac atactgtgtg gtataacagg cttaataaat tctttaaaag 180 
gaaaaaaa 

<210> 301 

<211> 291 

<212> DNA 

<213> Homo sapiens 



<400> 301 
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aagattttgt tttattttat tatggctaga 
acactaaaga aatcctctgt gcttttcaat 
tggtgtgact tcaagagttc atgttaactt 
tgtattcttg aagagcctgg gccatgaaga 
tgatgttctg gcagtaagtg tttatctggc 

<210> 302 
<211> 341 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 25 

<223> n = A,T,C or G 



aagacactgt tatagccaaa atcggcaatg 60 
atgcaaatat atttcttcca agagttgccc 120 
cttttctgga aacttccttt tcttagttgt 180 
gcttgcctaa gttttgggca gtgaactcct 240 
ctgcaatgag cagcgagtcc a 291 



<400> 302 

tgatttttca taattttatt aaatnatcac 
attacactac aatctgatag gagtggtaaa 
aaacgccacc ttttattgtc ctgtcttatt 
ttcatgagcc agcagtggac ttgagttaca 
gaagaagcca tcaaattctt gaggacttga 
cccccgggct gcaggaattc gatatcaagc 

<210> 303 
<211> 361 
<212> DNA 
<213> Homo sapiens 



tgggaaaact aatggttcgc gtatcacaca 60 
accagccaat ggaatccagg taaagtacaa 120 
tctcgggaag gagggttcta ctttacacat 180 
atgtgtaggt tccttgtggt tatagctgca 24 0 
catctctcgg aaagaagcaa actagtggat 300 
ttatcgatac c 341 



<220> 

<221> misc_feature 

<222> 15, 27, 92, 124, 127, 183, 198, 244, 320 
<223> n = A,T,C or G 



<400> 303 

tgcagacagt aaatnaattt tatttgngtt 
gctccgtgac agcccaccaa cccccaaccc 
caanaanatg gaaggatctc acggatctca 
tanacagacg gagttganat gctggaggat 
ccanacttca tcccagccgg gacgtcctcc 
actttgccgc agttccaggn gtcctgcttc 
a 



cacagaacat actaggcgat ctcgacagtc 60 
tntacctcgc agccacccta aaggcgactt 120 
ttcctaatgg tccgccgaag tctcacacag 180 
gcagtcacct cctaaactta cgacccacca 240 
cccacccgag tcctccccat ttcttctcct 300 
caccagtccc acaaagctca ataaatacca 360 

361 



<210> 304 
<211> 301 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 23, 104, 192 
<223> n = A,T,C or G 



<400> 304 

ctctttacaa cagcctttat ttncggccct 
tagctccgcc cgccaggctc tgtgccgcct 
ctcaggggct tgaggccgta ctcccccagc 
aaggtcagcc anaacaggtc gtcctgcaca 



tgatcctgct cggatgctgg tggaggccct 60 
ccccgcaggc gcanattcat gaacacggtg 120 
gggagctggt cctccagggg cttcccctcg 180 
ccctccagcc cgctcacttg ctgcttcagg 240 
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tgggccacgg tctgcgtcag ccgcacctcg taggtgctgc tgcggccctt gttattcctc 300 

<210> 305 
<211> 331 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 3, 36, 60, 193, 223 

<223> n = A,T,C or G 

<400> 305 

ganaggctag taacatcagt tttattgggt tggggnggca accatagcct ggctgggggn 60 

ggggctggcc ctcacaggtt gttgagttcc agcagggtct ggtccaaggt ctggtgaatc 120 

tcgacgttct cctccttggc actggccaag gtctcttcta ggtcatcgat ggttttctcc 180 

aactttgcca canacctctc ggcaaactct gctcgggtct cancctcctt cagcttctcc 240 

tccaacagtt tgatctcctc ttcatattta tcttctttgg gggaatactc ctcctctgag 300 

gccatcaggg acttgagggc ctggtccatg g 331 

<210> 306 

<211> 457 

<212> DNA 

<213> Homo sapiens 

<400> 306 

aatatgtaaa ggtaataact tttattatat taaagacaat gcaaacgaaa aacagaattg 60 
• agcagtgcaa aatttaaagg actgttttgt tctcaaagtt gcaagtttca aagccaaaag 120 
aattatatgt atcaaatata taagtaaaaa aaagttagac tttcaagcct gtaatcccag 180 
cactttggga ggctgaggca ggtggatcac taacattaaa aagacaacat tagattttgt 240 
cgatttatag caattttata aatatataac tttgtcactt ggatcctgaa gcaaaataat 300 
aaagtgaatt tgggattttt gtacttggta aaaagtttaa caccctaaat tcacaactag 360 
tggatccccc gggctgcagg aattcgatat caagcttatc gataccgtcg acctcgaggg 420 
ggggcccggt acccaattcg ccctatagtg agtcgta 457 

<210> 307 
<211> 491 
<212> DNA 
<213> Homo sapiens 

<400> 307 • 

gtgcttggac ggaacccggc gctcgttccc caccccggcc ggccgcccat agccagccct 



60 
120 



ccgtcacctc ttcaccgcac cctcggactg ccccaaggcc cccgccgccg ctccagcgcc 
gcgcagccac cgccgccgcc gccgcctctc cttagtcgcc gccatgacga ccgcgtccac 180 
ctcgcaggtg cgccagaact accaccagga ctcagaggcc gccatcaacc gccagatcaa 240 
cctggagctc tacgcctcct acgtttacct gtccatgtct tactactttg accgcgatga 300 
tgtggctttg aagaactttg ccaaatactt tcttcaccaa tctcatgagg agagggaaca 360 
tgctgagaaa ctgatgaagc tgcagaacca acgaggtggc cgaatcttcc ttcaggatat 420 
caagaaacca gactgtgatg actgggagag cgggctgaat gcaatggagt- gtgcattaca 480 
tttggaaaaa a 491 

<210> 308 
<211> 421 
<212> DNA 
<213> Homo sapiens 



<400> 30B 



WO 02/47534 



132 



PCT/US01/47576 



ctcagcgctt cttctttctt ggtttgatcc 
aggccctgga tgtgatggtg tccaccttcc 
tcaagctcaa caagtcagaa ctaaaggagc 
ggaaaaggac agatgaagct gctttccaga 
acaacgaggt ggacttccaa gagtactgtg 
acgaattctt tgaaggcttc ccagataagc 
gtggttgggg ggtctgccag ctggggccct 
c 



tgactgctgt catggcgtgc cctctggaga 60 
acaagtactc gggcaaagag ggtgacaagt 120 
tgctgacccg ggagctgccc agcttcttgg 180 
agctgatgag caacttggac agcaacaggg 240 
tcttcctgtc ctgcatcgcc atgatgtgta 300 
agcccaggaa gaaatgaaaa ctcctctgat 360 
ccctgtcgcc agtgggcact tttttttttc 420 

421 



<210> 309 
<211> J 321 
<212> DNA 
<213> Homo sapiens 

<400> 309 

accaaatggc ggatgacgcc ggtgcagcgg gggggcccgg gggccctggt ggccctggga 60 

tggggaaccg cggtggcttc cgcggaggtt tcggcagtgg catccggggc cggggtcgcg 120 

gccgtggacg gggccggggc cgaggccgcg gagctcgcgg aggcaaggcc gaggataagg 180 

agtggatgcc cgtcaccaag ttgggccgct tggtcaagga catgaagatc aagtccctgg 240 

aggagatcta tctcttctcc ctgcccatta aggaatcaga gatcattgat ttcttcctgg 300 
gggcctctct caaggatgag g 321 

<210> 310 
<211> 381 
<212> DNA 
<213> Homo sapiens 

<400> 310 

ttaaccagcc atattggctc aataaatagc ttcggtaagg agttaatttc cttctagaaa 60 

tcagtgccta tttttcctgg aaactcaatt ttaaatagtc caattccatc tgaagccaag 120 

ctgttgtcat tttcattcgg tgacattctc tcccatgaca cccagaaggg gcagaagaac 180 

cacatttttc atttatagat gtttgcatcc tttgtattaa aattattttg aaggggttgc 240 

ctcattggat ggcttttttt tttttcctcc agggagaagg ggagaaatgt acttggaaat 300 

taatgtatgt ttacatctct ttgcaaattc ctgtacatag agatatattt tttaagtgtg 360 

aatgtaacaa catactgtga a 381 



<210> 311 
<211> 530 
<212>~DNA 
<213> Homo sapiens 



<400> 311 

tttgaattta caccaagaac ttctcaataa 
cataccacaa gagaagttaa tttcttaaca 
accaagttct gatatctttt aaagacatag 
tgaaaatatc cttgttgtgt attaggtttt 
gtcatcagta ccctcctatt cagctcccca 
ggttttcttc ttatttttag ataattcaag 
tttatggtaa actcttttaa agaaaattta 
ttaaatcttt atcatagact ctgtacatat 
atcatcggtg ggatgacaga acaaacatat 



aagaaaatca tgaatgctcc acaatttcaa 60 
ttgtgttcta tgattatttg taagaccttc 120 
ttcaaaattg cttttgaaaa tctgtattct 180 
taaataccag ctaaaggatt acctcactga 240 
agatgatgtg tttttgctta ccctaagaga 300 
tgcttagata aattatgttt tctttaagtg 360 
atatgttata gctgaatctt tttggtaact 420 
gttcaaatta gctgcttgcc tgatgtgtgt 480 
ttatgatcat gaataatgtg ctttgtaa 538 



<210> 312 
<211> 176 
<212> DNA 
<213> Homo sapiens 



<400> 312 
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ggaggagcag ctgagagata gggtcagtga atgcggttca gcctgctacc tctcctgtct 60 
tcatagaacc attgccttag aattattgta tgacacgttt tttgttggtt aagctgtaag 120 
gttttgttct ttgtgaacat gggtattttg aggggagggt ggagggagta gggaag 17 6 

<210> 313 
<211> 396 
<212> DNA 
<213> Homo sapiens 



60 



<400> 313 

ccagcacccc caggccctgg gggacctggg ttctcagact gccaaagaag ccttgccatc 
tggcgctccc atggctcttg caacatctcc ccttcgtttt tgagggggtc atgccggggg 120 
agccaccagc ccctcactgg gttcggagga gagtcaggaa gggccaagca cgacaaagca 180 
gaaacatcgg atttggggaa cgcgtgtcaa tcccttgtgc cgcagggctg ggcgggagag 240 
actgttctgt tccttgtgta actgtgttgc tgaaagacta cctcgttctt gtcttgatgt 300 
gtcaccgggg caactgcctg ggggcgggga tgggggcagg gtggaagcgg ctccccattt 360 
tataccaaag gtgctacatc tatgtgatgg gtgggg 396 

<210> 314 
<211> 311 
<212> DNA 
<213> Homo sapiens 

<400> 314 

cctcaacatc ctcagagagg actggaagcc agtccttacg ataaactcca taatttatgg 60 

cctgcagtat ctcttcttgg agcccaaccc cgaggaccca ctgaacaagg aggccgcaga 120 

ggtcctgcag aacaaccggc ggctgtttga gcagaacgtg cagcgctcca tgcggggtgg 180 

ctacatcggc tccacctact ttgagcgctg cctgaaatag ggttggcgca tacccacccc 240 

cgccacggcc acaagccctg gcatcccctg caaatattta ttgggggcca tgggtagggg 300 
tttggggggc g 311 

<210> 315 
<211> 336 
<212> DNA 
<213> Homo sapiens 

<400> 315 

tttagaacat ggttatcatc caagactact ctaccctgca acattgaact cccaagagca 60 

aatccacatt cctcttgagt tctgcagctt ctgtgtaaat agggcagctg tcgtctatgc 120 

cgtagaatca catgatctga ggaccattca tggaagctgc taaatagcct agtctgggga 180 

gtcttccata aagttttgca tggagcaaac aaacaggatt aaactaggtt tggttccttc 240 

agccctctaa aagcataggg cttagcctgc aggcttcctt gggctttctc tgtgtgtgta 300 

gttttgtaaa cactatagca tctgttaaga tccagt 336 

<210> 316 

<211> 436 

<212> DNA 

<213> Homo sapiens 

<400> 316 

aacatggtct gcgtgcctta agagagacgc ttcctgcaga acaggacctg actacaaaga 60 

atgtttccat tggaattgtt ggtaaagact tggagtttac aatctatgat gatgatgatg 120 

tgtctccatt cctggaaggt cttgaagaaa gaccacagag aaaggcacag cctgctcaac 180 

ctgctgatga acctgcagaa aaggctgatg aaccaatgga acattaagtg ataagccagt 240 

ctatatatgt attatcaaat atgtaagaat acaggcacca catactgatg acaataatct 300 

atactttgaa ccaaaagttg cagagtggtg gaatgctatg ttttaggaat cagtccagat 360 

gtgagttttt tccaagcaac ctcactgaaa cctatataat ggaatacatt tttctttgaa 420 

agggtctgta taatca 436 
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<210> 317 
<211> 196 
<212> DNA 

<213> Homo sapiens 
<400> 317 

tattccttgt gaagatgata tactattttt gttaagcgtg tctgtattta tgtgtgagga 60 
gctgctggct tgcagtgcgc gtgcacgtgg agagctggtg cccggagatt ggacggcctg 120 
atgctccctc ccctgccctg gtccagggaa gctggccgag ggtcctggct cctgaggggc 180 
atctgcccct ccccca 196 

<210> 318 
<211> 381 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 8, 9, 102, 122, 161, 182, 193, 235, 253, 265, 266, 290, 321, 
378 

<223> n = A,T,C or G 
<400> 318 

gacgcttnng ccgtaacgat gatcggagac atcctgctgt tcgggacgtt gctgatgaat 60 
gccggggcgg tgctgaactt taagctgaaa aagaaggaca cncagggctt tggggaggag 120 
tncagggagc ccaacacagg tgacaacatc cgggaattct tgctgancct cagatacttt 180 
cnaatcttca tcnccctgtg gaacatcttc atgatgttct gcatgattgt gctgntcggc 240 
tcttgaatcc cancgatgaa accannaact cactttcccg ggatgccgan tctccattcc 300 
tccattcctg atgacttcaa naatgttttt gaccaaaaaa ccgacaacct tcccagaaag 360' 
tccaagctcg tggtgggngg a 381 

<210> 319 
<211> 506 
<212> DNA 
<213> Homo sapiens 



<400> 319 

ctaagcttta cgaatggggt gacaacttat 
tttgtaaata cctttgttat aattgatagg 
cctctgagca gtgtatgtca ggacttgttc 
ttatacaggt agagatgtat gcagatgtgt 
ccattgatgt atgcatctct tggctgtact 
ctttgctaat attttaatgg tatagatctg 
tctgttgctg tgtgtttcat tttaaattga 
actctgccaa tgcttttatc tagaggcgtg 
tcccaagaaa ggcaggatta catctt 



gataaaaact agagctagtg aattagccta 60 
atacatcttg gacatggaat tgttaagcca 120 
attaggttgg cagcagaggg gcagaaggaa 180 
ccatatatgt ccatatttac attttgatag 240 
ataagaacac attaattcaa tggaaataca 300 
ctaatgaatt ctcttaaaaa catactgtat 360 
gcattaaggg aatgcagcat ttaaatcaga 420 
ttgccatttt tgtcttatat gaaatttctg 480 

506 



<210> 320 
<211> 351 
<212> DNA 

<213> Homo sapiens 
<400> 320 

ctgacctgca ggacgaaacc atgaagagcc 
cggtagtaac tttgtgttat gaatcacatg 
tcattaacag gagaaatgca aataccttca 
tccaagagag gatccgagaa cgctctaagc 



tgatccttct tgccatcctg gccgccttag 60 
aaagcatgga atcttatgaa cttaatccct 120 
tatcccctca gcagagatgg agagctaaag 180 
ctgtccacga gctcaatagg gaagcctgtg 240 
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atgactacag actttgcgaa cgctacgcca tggtttatgg atacaatgct gcctataatc 300 
gctacttcag gaagcgccga gggaccaaat gagactgagg gaagaaaaaa a 351 

<210> 321 
<211> 421 
<212> DNA 

<213> Homo sapiens 
<400> 321 

ctcggaggcg ttcagctgct tcaagatgaa gctgaacatc tccttcccag ccactggctg 60 
ccagaaactc attgaagtgg acgatgaacg caaacttcgt actttctatg agaagcgtat 120 
ggccacagaa gttgctgctg acgctctggg tgaagaatgg aagggttatg tggtccgaat 180 
cagtggtggg aacgacaaac aaggtttccc catgaagcag ggtgtcttga cccatggccg 240 
tgtccgcctg ctactgagta aggggcattc ctgttacaga ccaaggagaa ctggagaaag 300 
aaagagaaaa tcagttcgtg gttgcattgt ggatgcaaat ctgagcgttc tcaacttggt 360 
tattgtaaaa aaaggagaga aggatattcc tggactgact gatactacag tgcctcgccg 420 
c 421 

<210> 322 
<211> 521 
<212> DNA 
<213> Homo sapiens 

<400> 322 

agcagctctc ctgccacagc tcctcacccc ctgaaaatgt tcgcctgctc caagtttgtc 60 
tccactccct ccttggtcaa gagcacctca cagctgctga gccgtccgct atctgcagtg 120 
gtgctgaaac gaccggagat actgacagat gagagcctca gcagcttggc agtctcatgt 180 
ccccttacct cacttgtctc tagccgcagc ttccaaacca gcgccatttc aagggacatc 240 
gacacagcag ccaagttcat tggagctggg gctgccacag ttggggtggc tggttctggg 300 
gctgggattg gaactgtgtt tgggagcctc atcattggtt atgccaggaa cccttctctg 360 
aagcaacagc tcttctccta cgccattctg ggctttgccc tctcggaggc catggggctc 420 
ttttgtctga tggtagcctt tctcatcctc tttgccatgt gaaggagccg tctccacctc 480 
ccatagttct cccgcgtctg gttggccccg tgtgttcctt t 521 

<210> 323 
<211> 435 
<212> DNA 
<213> Homo sapiens 

<400> 323* 

ccgaggtcgc acgcgtgaga cttctccgcc gcagacgccg ccgcgatgcg ctacgtcgcc 60 

tcctacctgc tggctgccct agggggcaac tcctccccca gcgccaagga catcaagaag 120 

atcttggaca gcgtgggtat cgaggcggac gacgaccggc tcaacaaggt tatcagtgag 180 

ctgaatggaa aaaacattga agacgtcatt gcccagggta ttggcaagct tgccagtgta 240 

cctgctggtg gggctgtagc cgtctctgct gccccaggct ctgcagcccc tgctgctggt 300 

tctgcccctg ctgcagcaga ggagaagaaa gatgagaaga aggaggagtc tgaagagtca 360 

gatgatgaca tgggatttgg cctttttgat taaattcctg ctcccctgca aataaagcct 420 
ttttacacat ctcaa 435 

<210> 324 
<211> 521 
<212> DNA 
<213> Homo sapiens 

<400> 324 

aggagatcga ctttcggtgc ccgcaagacc agggctggaa cgccgagatc acgctgcaga 60 

tggtgcagta caagaatcgt caggccatcc tggcggtcaa atccacgcgg cagaagcagc 120 

agcacctggt ccagcagcag cccccctcgc agccgcagcc gcagccgcag ctccagcccc 180 
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aaccccagcc tcagcctcag ccgcaacccc 
aacccaagcc tcagccccag cagctccacc 
ctcatcctca ctcgcaccca caccctcacc 
cacacccaca gccgcactcg cagccgcacg 
ctgcctgaaa ggggcagctc ccgggcaaga 
gagcacattt ctattgtctt cacttggatc 



agccccaatc acaaccccag cctcagcccc 240 
cgtatccgca tccacatcca catccacact 300 
cgcacccgca tccgcaccaa ataccgcacc 360 
ggcaccggct tctccgcagc acctccaact 420 
caaggttttg aggacttgag gaagtgggac 480 
aaaagcaaaa c 521 



<210> 325 
<2U> 451 
<212> DNA 
<213> Homo sapiens 

<400> 325 

attttcattt ccattaacct ggaagctttc 
acattattta aacagaaaaa gatgggctct 
tatttttact tagattactt tgggaatgag 
agtgaatgtg tctgtagttg tgttagtttg 
atccaaataa gaggcatata cattgaattg 
acccccaccc ccacccaaga cattttaata 
acatgaggta gtgttccact ggcaggatga 
ctttatcact tgaattatta acttaatttg 

<210> 326 
<211> 421 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 296 

<223> n » A,T,C or G 



atgaatattc tcttctttta aaacatttta 60 
ttctggttag ttgttacatg atagcagaga 120 
agattgttgt cttgaactct ggcactgtac 180 
cattaagcat gtataacatt caagtatgtc 24 0 
tttttaatcc tctgacaagt tgactcttcg 300 
gtaaatagag agagagagaa gagttaatga 360 
cttttcaata gctcaaatca atttcagtgc 420 
a 451 



<400> 326 

cgcggtcgta agggctgagg atttttggtc 
tcgctctcgc cgaggaacaa gtcggtcagg 
ggataccgga aaaacacccg tggagccgga 
aacaagccgc aacgtaaaat ccttggaaaa 
agaaaagaat ctcaaagtga aaggaccagt 
tacaagaaaa actccttgtg gtgaaggttc 
tcacaagcga ctcattgact tgcacagtcc 
c 



cgcacgctcc tgctcctgac tcaccgctgt 60 
aagcccgcgc gcaacagcca tggcttttaa 120 
ggtggcaatt caccgaattc gaatcaccct 180 
ggtgtgtgct gacttgataa gaggcgcaaa 240 
tcgaatgcct accaagactt tgagantcac 300 
taagacgtgg gatcgtttcc agatgagaat 360 
ttctgagatt gttaagcaga ttacttccat 420 

421 



<210> 327 
<211> 456 
<212> DNA 

<213> Homo sapiens 
<400> 327 

atcttgacga ggctgcggtg tctgctgcta 
cgacaagaag aagaaggacg ctggaaagtc 
atccgggggc aaggccaaaa agaagaagtg 
taacttagtc ttgtttgaca aagctaccta 
taaacttata accccagctg tggtctctga 
ggcagccctt caggagctcc ttagtaaagg 
tcaagtaatt tacaccagaa ataccaaggg 
atgaataggt ccaaccagct gtacatttgg 



ttctccgagc ttcgcaatgc cgcctaagga 60 
ggccaagaaa gacaaagacc cagtgaacaa 120 
gtccaaaggc aaagttcggg acaagctcaa 180 
tgataaactc tgtaaggaag ttcccaacta 240 
gagactgaag attcgaggct ccctggccag 300 
acttatcaaa ctggtttcaa agcacagagc 360 
tggagatgct ccagctgctg gtgaagatgc 420 
aaaaat 456 



<210> 328 
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<211> 471 
<212> DNA 

<213> Homo sapiens 
<400> 328 

gtggaagtga catcgtcttt aaaccctgcg tggcaatccc tgacgcaccg ccgtgatgcc 60 
cagggaagac agggcgacct ggaagtccaa ctacttcctt aagatcatcc aactattgga 120 
tgattatccg aaatgtttca ttgtgggagc agacaatgtg ggctccaagc agatgcagca 180 
gatccgcatg tcccttcgcg ggaaggctgt ggtgctgatg ggcaagaaca ccatgatgcg 240 
caaggccatc cgagggcacc tggaaaacaa cccagctctg gagaaactgc tgcctcatat 300 
ccgggggaat gtgggctttg tgttcaccaa ggaggacctc actgagatca gggacatgtt 360 
gctggccaat aaggtgccag ctgctgcccg tgctggtgcc attgccccat gtgaagtcac 420 
tgtgccagcc cagaacactg gtctcgggcc cgagaagacc tcctttttcc a 471 

<210> 329 
<211> 278 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 154, 204 
<223> n = A,T,C or G 

<400> 329 

gtttaaactt aagcttggta ccgagctcgg atccactagt ccagtgtggt ggaattctag 60 

aaattgagat gcccccccag gccagcaaat gttccttttt gttcaaagtc tatttttatt 120 

ccttgatatt tttctttttt tttttttttt ttgnggatgg ggacttgtga atttttctaa 180 

aggtgctatt taacatggga gganagcgtg tgcggctcca gcccagcccg ctgctcactt 240 

tccaccctct ctccacctgc ctctggcttc tcaggcct 278 

<210> 330 

<211> 338 

<212> DNA 

<213> Homo sapiens 

<400> 330 

ctcaggcttc aacatcgaat acgccgcagg ccccttcgcc ctattcttca tagccgaata 60 

cacaaacatt attataataa acaccctcac cactacaatc ttcctaggaa caacatatga 120 

cgcactctcc cctgaactct acacaacata ttttgtcacc aagaccctac ttctaacctc 180 

cctgttctta tgaattcgaa cagcataccc ccgattccgc tacgaccaac tcatacacct 240 

cctatgaaaa aacttcctac cactcaccct agcattactt atatgatatg tctccatacc 300 

cattacaatc tccagcattc cccctcaaac ctaaaaaa • 338 



<210> 331 

<211> 2820 

<212> DNA 

<213> Homo sapiens 

<400> 331 

tggcaaaatc ctggagccag aagaaaggac 
gttgtacctg gaaaacaatg cccagactca 
gctcctgaac agcatggacc agcagattcg 
cacagaccac gcgcagaaca gcgtcacggc 
cttcgatgct ctctctccat cacccgccat 
cagttccgac gtgtccttcc agcagtcgag 
cactgaactg aagaaactct actgccaaat 
gatgacccca cctcctcagg gagctgttat 



agcagcattg atcaatctta cagctaacat 60 
atttagtgag ccacagtaca cgaacctggg 120 
gaacggctcc tcgtccacca gtccctataa 180 
gccctcgccc tacgcacagc ccagccccac 240 
cccctccaac accgactacc caggcccgca 300 
caccgccaag tcggccacct ggacgtattc 360 
tgcaaagaca tgccccatcc agatcaaggt 420 
ccgcgccatg cctgtctaca aaaaagctga 480 
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gcacgtcacg gaggtggtga agcggtgccc 
gggacagatt gcccctccta gtcatttgat 
tgtagaagat cccatcacag gaagacagag 
tggcactgaa ttcacgacag tcttgtacaa 
gatgaaccgc cgtccaattt taatcattgt 
gggccgacgc tgctttgagg cccggatctg 
tgaagatagc atcagaaagc agcaagtttc 
gcgcccgttt cgtcagaaca cacatggtat 
cccagatgat gaactgttat acttaccagt 
gaagatcaaa gagtccctgg aactcatgca 
caggcaacag caacagcagc agcaccagca 
tccatcttca tatggtaaca gctccccacc 
gccttctgtg agccagctta tcaaccctca 
tcctgatggc atgggagcca acattcccat 
catgaatgga ctcagcccca cccaggcact 
ccactgcaca cccccacctc cgtatcccac 
gttgggctgt tcatcatgtc tggactattt 
gattgagcat tactccatgg atgatctggc 
tgcgatctgg aagggcatcc tggaccaccg 
tctcctgcgg accccaagca gtgcctctac 
tgagcgtgtt attgatgctg tgcgattcac 
agatgagtgg aatgacttca actttgacat 
caaagaggag ggggagtgag cctcaccatg 
agccccctaa aagcactcct gcttaatctt 
tcttgtctga tttcttaggg gaaggagaag 
ctggcatcta attctgattc tggctttaag 
ctgccatggc taggtagaag tgagcaaaaa 
tttctcattg acttttataa agcatgttca 
gtataaatat acagtataga tttttgggtg 
ttaaatgaaa gaaaattgag ttgcacttat 
ggcttgtcta tactccttcc cttaaggggt 
aatgctacat gtgagtgcga tgatgtacag 
tgccacatca aacctttgag tagatccatt 
gatatgtatt cttttctcag tgttggtata 
gatggttcac gttggggtga tttaatccag 
ctctttagtt tttggttggg aatgaggaaa 
aaaacacccg acgtcatgta tttgagcata 
taccttatct tacaatgttg attgggaaaa 
ctaaatttca ctactagatt gactaactca 



caaccatgag ctgagccgtg agttcaacga 540 
tcgagtagag gggaacagcc atgcccagta 600 
tgtgctggta ccttatgagc caccccaggt 660 
tttcatgtgt aacagcagtt gtgttggagg 720 
tactctggaa accagagatg ggcaagtcct 780 
tgcttgccca ggaagagaca ggaaggcgga 840 
ggacagtaca aagaacggtg atggtacgaa 900 
ccagatgaca tccatcaaga aacgaagatc 960 
gaggggccgt gagacttatg aaatgctgtt 1020 
gtaccttcct cagcacacaa ttgaaacgta 1080 
cttacttcag aaacagacct caatacagtc 1140 
tctgaacaaa atgaacagca tgaacaagct 1200 
gcagcgcaac gccctcactc ctacaaccat 1260 
gatgggcacc cacatgccaa tggctggaga 1320 
ccctccccca ctctccatgc catccacctc 1380 
agattgcagc attgtcagtt tcttagcgag 1440 
cacgacccag gggctgacca ccatctatca 1500 
aagtctgaaa atccctgagc aatttcgaca 1560 
gcagctccac gaattctcct ccccttctca 1620 
agtcagtgtg ggctccagtg agacccgggg 1680 
cctccgccag accatctctt tcccaccccg 1740 
ggatgctcgc cgcaataagc aacagcgcat 1800 
tgagctcttc ctatccctct cctaactgcc 1860 
caaagccttc tccctagctc ctccccttcc 1920 
taagaggcta cctcttacct aacatctgac 1980 
ccttcaaaac tatagcttgc agaactgtag 2040 
agagttgggt gtctccttaa gctgcagaga 2100 
cccttatagt ctaagactat atatataaat 2160 
gggggcattg agtattgttt aaaatgtaat 2220 
tgaccatttt ttaatttact tgttttggat 2280 
atcatgtatg gtgataggta tctagagctt 2340 
attctttcag ttctttggat tctaaataca 2400 
tccattgctt attatgtagg taagactgta 24 60 
ttttatatta ctgacatttc ttctagtgat 2520 
ttataagaag aagttcatgt ccaaacggtc 2580 
attcttaaaa ggcccatagc agccagttca 2640 
tcagtaaccc ccttaaattt aatacccaga 2700 
catttgctgc ccattacaga ggtattaaaa 27 60 
aatacacatt tgctactgtt gtaagaattc 2820 



<210> 332 

<211> 2270 

<212> DNA 

<213> Homo sapiens 

<400> 332 

tcgttgatat caaagacagt tgaaggaaat 
acagtactgc cctgaccctt acatccagcg 
aaagaaagtt attaccgatc caccatgtcc 
ccagaggttt tccagcatat ctgggatttt 
attgacttga actttgtgga tgaaccatca 
agcatggact gtatccgcat gcaggactcg 
acgaacctgg ggctcctgaa cagcatggac 
agtccctata acacagacca cgcgcagaac 
cccagctcca ccttcgatgc tctctctcca 
ccaggcccgc acagtttcga cgtgtccttc 
tggacgtatt ccactgaact gaagaaactc 



gaattttgaa acttcacggt gtgccaccct 60 
tttcgtagaa acccagctca tttctcttgg 12 0 
cagagcacac agacaaatga attcctcagt 180 
ctggaacagc ctatatgttc agttcagccc 240 
gaagatggtg cgacaaacaa gattgagatt 300 
gacctgagtg accccatgtg gccacagtac 360 
cagcagattc agaacggctc ctcgtccacc 420 
agcgtcacgg cgccctcgcc ctacgcacag 4 80 
tcacccgcca tcccctccaa caccgactac 540 
cagcagtcga gcaccgccaa gtcggccacc 600 
tactgccaaa ttgcaaagac atgccccatc 660 
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cagatcaagg tgatgacccc acctcctcag ggagctgtta tccgcgccat gcctgtctac 720 
aaaaaagctg agcacgtcac ggaggtggtg aagcggtgcc ccaaccatga gctgagccgt 780 
gaattcaacg agggacagat tgcccctcct agtcatttga ttcgagtaga ggggaacagc. 840 
catgcccagt atgtagaaga tcccatcaca ggaagacaga gtgtgctggt accttatgag 900 
ccaccccagg ttggcactga attcacgaca gtcttgtaca atttcatgtg taacagcagt 960 
tgtgttggag ggatgaaccg ccgtccaatt ttaatcattg ttactctgga aaccagagat 1020 
gggcaagtcc tgggccgacg ctgctttgag gcccggatct gtgcttgccc aggaagagac 1080 
aggaaggcgg atgaagatag catcagaaag cagcaagttt cggacagtac aaagaacggt 1140 
gatggtacga agcgcccgtt tcgtcagaac acacatggta tccagatgac atccatcaag 1200 
aaacgaagat ccccagatga tgaactgtta tacttaccag tgaggggccg tgagacttat 1260 
gaaatgctgt tgaagatcaa agagtccctg gaactcatgc agtaccttcc tcagcacaca 1320 
attgaaacgt acaggcaaca gcaacagcag cagcaccagc acttacttca gaaacagacc 1380 
tcaatacagt ctccatcttc atatggtaac agctccccac ctctgaacaa aatgaacagc 1440 
atgaacaagc tgccttctgt gagccagctt atcaaccctc agcagcgcaa cgccctcact 1500 
cctacaacca ttcctgatgg catgggagcc aacattccca tgatgggcac ccacatgcca 1560 
atggctggag acatgaatgg actcagcccc acccaggcac tccctccccc actctccatg 1620 
ccatccacct cccactgcac acccccacct ccgtatccaa cagattgcag cattgtcggt 1680 
ttcttagcga ggttgggctg ttcatcatgt ctggactatt tcacgaccca ggggctgacc 1740 
accatctatc agattgagca ttactccatg gatgatctgg caagtctgaa aatccctgag 1800 
caatttcgac atgcgatctg gaagggcatc ctggaccacc ggcagctcca cgaattctcc 1860 
tccccttctc atctcctgcg gaccccaagc agtgcctcta cagtcagtgt gggctccagt 1920 
gagacccggg gtgagcgtgt tattgatgct gtgcgattca ccctccgcca gaccatctct 1980 
ttcccacccc gagatgagtg gaatgacttc aactttgaca- tggatgctcg ccgcaataag 2040 
caacagcgca tcaaagagga gggggagtga gcctcaccat gtgagctctt cctatccctc 2100 
tcctaactgc cagcccccta aaagcactcc tgcttaatct tcaaagcctt ctccctagct 2160 
cctccccttc ctcttgtctg atttcttagg ggaaggagaa gtaagaggct acctcttacc 2220 
taacatctga cctggcatct aattctgatt ctggctttaa gccttcaaaa 2270 

<210> 333 

<211> 2816 

<212> DNA 

<213> Homo sapiens 

<400> 333 

tcgttgatat caaagacagt tgaaggaaat gaattttgaa acttcacggt gtgccaccct 60 
acagtactgc cctgaccctt acatccagcg tttcgtagaa acccagctca tttctcttgg 120 
aaagaaagtt attaccgatc caccatgtcc cagagcacac agacaaatga attcctcagt 180 
ccagaggttt tccagcatat ctgggatttt ctggaacagc ctatatgttc agttcagccc 240 
attgacttga actttgtgga tgaaccatca gaagatggtg cgacaaacaa gattgagatt 300 
agcatggact gtatccgcat gcaggactcg gacctgagtg accccatgtg gccacagtac 360 
acgaacctgg ggctcctgaa cagcatggac cagcagattc agaacggctc ctcgtccacc 420 
agtccctata acacagacca cgcgcagaac agcgtcacgg cgccctcgcc ctacgcacag 4 80 
cccagctcca ccttcgatgc tctctctcca tcacccgcca tcccctccaa caccgactac 540 
ccaggcccgc acagtttcga cgtgtccttc cagcagtcga gcaccgccaa gtcggccacc 600 
tggacgtatt ccactgaact gaagaaactc tactgccaaa ttgcaaagac atgccccatc 660 
cagatcaagg tgatgacccc acctcctcag ggagctgtta tccgcgccat gcctgtctac 720 
aaaaaagctg agcacgtcac ggaggtggtg aagcggtgcc ccaaccatga gctgagccgt 780 
gaattcaacg agggacagat tgcccctcct agtcatttga ttcgagtaga ggggaacagc 840 
catgcccagt atgtagaaga tcccatcaca ggaagacaga gtgtgctggt accttatgag 900 
ccaccccagg ttggcactga attcacgaca gtcttgtaca atttcatgtg taacagcagt 960 
tgtgttggag ggatgaaccg ccgtccaatt ttaatcattg ttactctgga aaccagagat 1020 
gggcaagtcc tgggccgacg ctgctttgag gcccggatct gtgcttgccc aggaagagac 1080 
aggaaggcgg atgaagatag catcagaaag cagcaagttt cggacagtac aaagaacggt 1140 
gatggtacga agcgcccgtt tcgtcagaac acacatggta tccagatgac atccatcaag 1200 
aaacgaagat ccccagatga tgaactgtta tacttaccag tgaggggccg tgagacttat 1260 
gaaatgctgt tgaagatcaa agagtccctg gaactcatgc agtaccttcc tcagcacaca 1320 
attgaaacgt acaggcaaca gcaacagcag cagcaccagc acttacttca gaaacatctc 1380 
ctttcagcct gcttcaggaa tgagcttgtg gagccccgga gagaaactcc aaaacaatct 14 40 
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gacgtcttct ttagacattc caagccccca aaccgatcag tgtacccata gagccctatc 1500 
tctatatttt aagtgtgtgt gttgtatttc catgtgtata tgtgagtgtg tgtgtgtgta 1560 
tgtgtgtgcg tgtgtatcta gccctcataa acaggacttg aagacacttt ggctcagaga 1620 
cccaactgct caaaggcaca aagccactag tgagagaatc ttttgaaggg actcaaacct 1680 
ttacaagaaa ggatgttttc tgcagatttt gtatccttag accggccatt ggtgggtgag 1740 
gaaccactgt gtttgtctgt gagctttctg ttgtttcctg ggagggaggg gtcaggtggg 1800 
gaaaggggca ttaagatgtt tattggaacc cttttctgtc ttcttctgtt gtttttctaa 1860 
aattcacagg gaagcttttg agcaggtctc aaacttaaga tgtcttttta agaaaaggag 1920 
aaaaaagttg ttattgtctg tgcataagta agttgtaggt gactgagaga ctcagtcaga 1980 
cccttttaat gctggtcatg taataatatt gcaagtagta agaaacgaag gtgtcaagtg 2040 
tactgctggg cagcgaggtg atcattacca aaagtaatca actttgtggg tggagagttc 2100 
tttgtgagaa cttgcattat ttgtgtcctc ccctcatgtg taggtagaac atttcttaat 2160 
gctgtgtacc tgcctctgcc actgtatgtt ggcatctgtt atgctaaagt ttttcttgta 2220 
catgaaaccc tggaagacct actacaaaaa aactgttgtt tggcccccat agcaggtgaa 2280 
ctcattttgt gcttttaata gaaagacaaa tccaccccag taatattgcc cttacgtagt 2340 
tgtttaccat tattcaaagc tcaaaataga atttgaagcc ctctcacaaa atctgtgatt 2400 
aatttgctta attagagctt ctatccctca agcctaccta ccataaaacc agccatatta 24 60 
ctgatactgt tcagtgcatt tagccaggag acttacgttt tgagtaagtg agatccaagc 2520 
agacgtgtta aaatcagcac tcctggactg gaaattaaag attgaaaggg tagactactt 2580 
ttcttttttt tactcaaaag tttagagaat ctctgtttct ttccatttta aaaacatatt 2640 
ttaagataat agcataaaga ctttaaaaat gttcctcccc tccatcttcc cacacccagt 2700 
caccagcact gtattttctg tcaccaagac aatgatttct tgttattgag gctgttgctt 2760 
ttgtggatgt gtgattttaa ttttcaataa acttttgcat cttggtttaa aagaaa 2816 

<210> 334 
<211> 2082 
<212> DNA 

<213> Homo sapiens 
<400> 334 

agatgctaca gcgactgcac acccaggctg tatgatacag cctattgctc ccgggctgca 60 
aacctgtcca gcatgtgatg tggtgggata ctgaattgaa taccgaatac tgtaggcaat 120 
tgtaacacag tggtaagtct ttgtgtatct aaacatagct aaacaccaaa aggtatagta 180 
agaatatggt attataatct tatggaacta tcattgtata tgtggtttgt caaccagaat 240 
gtagttatac agcacaggac tgtgcttatg atgtgccaag cacagctctc agtactaact 300 
cctttaatct tcatatcaac cctaggaggt aacttcttaa gtagattcat attgtaaggg 360 
tctcggggtg ggggggttgg caaaatcctg gagccagaag aaaggacagc agcattgatc 420 
aatcttacag ctaacatgtt gtacctggaa aacaatgccc agactcaatt tagtgagcca 480 
cagtacacga acctggggct cctgaacagc atggaccagc agattcagaa cggctcctcg 540 
tccaccagtc cctataacac agaccacgcg cagaacagcg tcacggcgcc ctcgccctac 600 
gcacagccca gctccacctt cgatgctctc tctccatcac ccgccatccc ctccaacacc 660 
gactacccag gcccgcacag tttcgacgtg tccttccagc agtcgagcac cgccaagtcg 720 
gccacctgga cgtattccac tgaactgaag aaactctact gccaaattgc aaagacatgc 780 
cccatccaga tcaaggtgat gaccccacct cctcagggag ctgttatccg cgccatgcct 84 0 
gtctacaaaa aagctgagca cgtcacggag gtggtgaagc ggtgccccaa ccatgagctg 900 
agccgtgaat tcaacgaggg acagattgcc cctcctagtc atttgattcg agtagagggg 960 
aacagccatg cccagtatgt agaagatccc atcacaggaa gacagagtgt gctggtacct 1020 
tatgagccac cccaggttgg cactgaattc acgacagtct tgtacaattt catgtgtaac 1080 
agcagttgtg ttggagggat gaaccgccgt ccaattttaa tcattgttac tctggaaacc 1140 
agagatgggc aagtcctggg ccgacgctgc tttgaggccc ggatctgtgc ttgcccagga 1200 
agagacagga aggcggatga agatagcatc agaaagcagc aagtttcgga cagtacaaag 1260 
aacggtgatg gtacgaagcg cccgtctcgt cagaacacac atggtatcca gatgacatcc 1320 
atcaagaaac gaagatcccc agatgatgaa ctgttatact taccagtgag gggccgtgag 1380 
acttatgaaa tgctgttgaa gatcaaagag tccctggaac tcatgcagta ccttcctcag 144 0 
cacacaattg aaacgtacag gcaacagcaa cagcagcagc accagcactt acttcagaaa 1500 
cagtgagtgt atcaacgtgt cattttagga ggcatgagtg acggtgactt tatttggatc 1560 
agcaataggg tgattgatga gcaatgtgga acataatggg agatagcaga ttgtcataga 1620 
ttcagatgac ctggtatggc aaccctcttt cagttgcaac cttttttacg tgtcttatta 1680 
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taaccttccc ttcagaattc cacttatgtt 
attacaaaga aactcacact aacagttctc 
cagtaagtac acactctatt tggtagtgat 
catcccaatg gattgtctta taaatctcct 
cactgtagac cagggatagc aaataggctt 
ctgtaatgag aagaattctg agacctagtg 
aaggataagg tagcatcatg ttgccgtatt 

<210> 335 

<211> 4849 

<212> DNA 

<213> Homo sapiens 

<400> 335 

cgttgatatc aaagacagtt gaaggaaatg 
cagtactgcc ctgaccctta catccagcgt 
aaagaaagtt attaccgatc caccatgtcc 
ccagaggttt tccagcatat ctgggatttt 
attgacttga actttgtgga tgaaccatca 
agcatggact gtatccgcat gcaggactcg 
acgaacctgg ggctcctgaa cagcatggac 
agtccctata acacagacca cgcgcagaac 
cccagctcca ccttcgatgc tctctctcca 
ccaggcccgc acagtttcga cgtgtccttc 
tggacgtatt ccactgaact gaagaaactc 
cagatcaagg tgatgacccc acctcctcag 
aaaaaagctg agcacgtcac ggaggtggtg 
gaattcaacg agggacagat tgcccctcct 
catgcccagt atgtagaaga tcccatcaca 
ccaccccagg ttggcactga attcacgaca 
tgtgttggag ggatgaaccg ccgtccaatt 
gggcaagtcc tgggccgacg ctgctttgag 
aggaaggcgg atgaagatag catcagaaag 
gatggtacga agcgcccgtt tcgtcagaac 
aaacgaagat ccccagatga tgaactgtta 
gaaatgctgt tgaagatcaa agagtccctg 
attgaaacgt acaggcaaca gcaacagcag 
tcaatacagt ctccatcttc atatggtaac 
atgaacaagc tgccttctgt gagccagctt 
cctacaacca ttcctgatgg catgggagcc 
atggctggag acatgaatgg actcagcccc 
ccatccacct cccagtgcac acccccacct 
ttcttagcga ggttgggctg ttcatcatgt 
accatctatc agattgagca ttactccatg 
caatttcgac atgcgatctg gaagggcatc 
tccccttctc atctcctgcg gaccccaagc 
gagacccggg gtgagcgtgt tattgatgct 
ttcccacccc gagatgagtg gaatgacttc 
caacagcgca tcaaagagga gggggagtga 
tcctaactgc cagcycccta aaagcactcc 
cctccccttc ctcttgtctg atttcttagg 
taacatctga cctggcatct aattctgatt 
cagaactgta gctgccatgg ctaggtagaa 
agctgcagag atttctcatt gacttttata 
tatatataaa tgtataaata tacagtatag 
taaaatgtaa tttaaatgaa agaaaattga 
ttgttttgga tggcttgtct atactccttc 
atctagagct taatgctaca tgtgagtgac 
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ctgaaattaa atacaaacca tttctggtga 1740 
ttctctatat gcctggtcca tacacactaa 1800 
gtgtatattt gaaaacatga aatcttttct 1860 
gggatgcaca ctatccactt ttgggaataa 1920 
tactataata taaagtgact tgtttgaatg 1980 
catgataatt ggggaaatat ctgggtgcag 2040 
ttagcatctc tg 2082 



aattttgaaa cttcacggtg tgccacccta 60 
ttcgtagaaa ccccagctca tttctcttgg 120 
cagagcacac agacaaatga attcctcagt 180 
ctggaacagc ctatatgttc agttcagccc 24 0 
gaagatggtg cgacaaacaa gattgagatt 300 
gacctgagtg accccatgtg gccacagtac 360 
cagcagattc agaacggctc ctcgtccacc 420 
agcgtcacgg cgccctcgcc ctacgcacag 480 
tcacccgcca tcccctccaa caccgactac 540 
cagcagtcga gcaccgccaa gtcggccacc 600 
tactgccaaa ttgcaaagac atgccccatc 660 
ggagctgtta tccgcgccat gcctgtctac 720 
aagcggtgcc ccaaccatga gctgagccgt 780 
agtcatttga ttcgagtaga ggggaacagc 84 0 
ggaagacaga gtgtgctggt accttatgag 900 
gtcttgtaca atttcatgtg taacagcagt 960 
ttaatcattg ttactctgga aaccagagat 1020 
gcccggatct gtgcttgccc aggaagagac 1080 
cagcaagttt cggacagtac aaagaacggt 1140 
acacatggta tccagatgac atccatcaag 1200 
tacttaccag tgaggggccg tgagacttat 1260 
gaactcatgc agtaccttcc tcagcacaca 1320 
cagcaccagc acttacttca gaaacagacc 1380 
agctccccac ctctgaacaa aatgaacagc 1440 
atcaaccctc agcagcgcaa cgccctcact 1500 
aacattccca tgatgggcac ccacatgcca 1560 
acccaggcac tccctccccc actctccatg 1620 
ccgtatccca cagattgcag cattgtcagt 1680 
ctggactatt tcacgaccca ggggctgacc 1740 
gatgatctgg caagtctgaa aatccctgag 1800 
ctggaccacc ggcagctcca cgaattctcc 18 60 
agtgcctcta cagtcagtgt gggctccagt 1920 
gtgcgattca ccctccgcca gaccatctct 1980 
aactttgaca tggatgctcg ccgcaataag 2040 
gcctcaccat gtgagctctt cctatccctc 2100 
tgcttaatct tcaaagcctt ctccctagct 2160 
ggaaggagaa gtaagaggct acctcttacc 2220 
ctggctttaa gccttcaaaa ctatagcttg 2280 
gtgagcaaaa aagagttggg tgtctcctta 2340 
aagcatgttc acccttatag tctaagacta 2400 
atttttgggt ggggggcatt gagtattgtt 2460 
gttgcactta ttgaccattt tttaatttac 2520 
ccttaagggg tatcatgtat ggtgataggt 2580 
gatgatgtac agattctttc agttctttgg 2640 
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attctaaata catgccacat caaacctttg 
ggtaagactg tagatatgta ttcttttctc 
tcttctagtg atgatggttc acgttggggt 
gtccaaacgt cctctttagt ttttggttgg 
cagccagttc aaaaacaccc gacgtcatgt 
taataccaga taccttatct tacaatattg 
gtattaaaac taaatttcac tactagattg 
taagaattct gattgatttg attgggatga 
tttactgtct attaatattc agggtaaata 
aaacagtaag atatctcaat gaaccataaa 
gataatattg tttggtaaat gtttcttttg 
tattctataa aactctgcat gtagaggctt 
aggagtggtg atttgaaaaa tataaaatta 
atcactgtat cattttcttt tttaaccggt 
gagaacccag tttcccgtcc atctccctta 
cagagcaaga gataagtctt tcatggctgc 
ccttgaaact ttgggaaaac atgttaatga 
catttttttg catgcatgca aatgagctct 
tgtcattgca cataagcttc cattttaatt 
aaaggtaatg tgtggattgc ctctgaaaag 
tttgtatttt gattattttt tttttcttct 
gaaacctttt tttatcgttt ttgtattttc 
tcaaataaga aataatgcta caattttaag 
atttttttaa aattttgtat gttaaagaga 
cttaaatggt aataagcact gtaaacttct 
aaggggaaga atgaaagctg ttccttggtc 
ttgctgaggg tttgaataaa cctaggactt 
ctagggcctt ggaaattcct gtactgtgtc 
acccttactg gcttacctcc tcatggcagc 
gtaaggggta aaaggatagt aagcatagaa 
tgtggcctca gctcaatgca gttagctgaa 
ataaacagaa atggaaagca gagttttcat 
aatcccctaa aataacagta tgtgggatat 
atttttataa ttgtacaaaa ttaagcaaat 
ttcaaaaggt attatacatg tgatacattt 
tttctgttat gggcttttgg ggagccagaa 
gacatgcaat aaaatttaaa aaataaataa 

<210> 336 

<211> 1386 

<212> DNA 

<213> Homo sapiens 



agtagatcca tttccattgc ttattatgta 2700 
agtgttggta tattttatat tactgacatt 27 60 
gatttaatcc agttataaga agaagttcat 2820 
gaatgaggaa aattcttaaa aggcccatag 2880 
atttgagcat atcagtaacc cccttaaatt 2940 
attgggaaaa catttgctgc cattacagag 3000 
actaactcaa atacacattt gctactgttg 3060 
atgccatcta tctagttcta acagtgaagt 3120 
ggaatcattc agaaatgttg agtctgtact 3180 
ttcaactttg taaaaatctt ttgaagcata 3240 
tttggtaaat gtttctttta aagaccctcc 3300 
gtttaccttt ctctctctaa ggtttacaat 3360 
tgagattggt tttcctgtgg cataaattgc 3420 
aagagtttca gtttgttgga aagtaactgt 34 80 
gggactaccc atagacatga aaggtcccca 3540 
tgttgcttaa accacttaaa cgaagagttc 3600 
caatattcca gatctttcag aaatataaca 3660 
gaaatcttcc catgcattct ggtcaagggc 3720 
ttaaagtgca aaagggccag cgtggctcta 3780 
tgtgtatata ttttgtgtga aattgcatac 3840 
tgggatagtg ggatttccag aaccacactt 3900 
atgaaaatac catttagtaa gaataccaca 3960 
aggggaggga agggaaagtt tttttttatt 4020 
atgagtcctt gatttcaaag ttttgttgta 4080 
gcaacaagca tgcagctttg caaacccatt 4140 
ctagtaagaa gacaaactgc ttcccttact 4200 
ccgagctatg tcagtactat tcaggtaaca 4260 
tcatggattt ggcactagcc aaagcgaggc 4320 
ctactctcct tgagtgtatg agtagccagg 4380 
accactagaa agtgggctta atggagttct 4440 
gaattgaaaa gtttttgttt ggagacgttt 4500 
taaatccttt tacctttttt ttttcttggt 4560 
tgaatgttaa agggatattt tttttctatt 4620 
gttaaaagtt ttatatgctt tattaatgtt 4680 
tttaagcttc agttgcttgt cttctggtac 4740 
gccaatctac aatctctttt tgtttgccag 4800 
aaactaatta agaaataaa 4849 



<400> 336 

atgttgtacc tggaaaacaa tgcccagact 
gggctcctga acagcatgga ccagcagatt 
aacacagacc- acgcgcagaa cagcgtcacg 
accttcgatg ctctctctcc atcacccgcc 
cacagtttcg acgtgtcctt ccagcagtcg 
tccactgaac tgaagaaact ctactgccaa 
gtgatgaccc cacctcctca gggagctgtt 
gagcacgtca cggaggtggt gaagcggtgc 
gagggacaga ttgcccctcc tagtcatttg 
tatgtagaag atcccatcac aggaagacag 
gttggcactg aattcacgac agtcttgtac 
gggatgaacc gccgtccaat tttaatcatt 
ctgggccgac gctgctttga ggcccggatc 
gatgaagata gcatcagaaa gcagcaagtt 



caatttagtg agccacagta cacgaacctg 60 
cagaacggct cctcgtccac cagtccctat 120 
gcgccctcgc cctacgcaca gcccagctcc 180 
atcccctcca acaccgacta cccaggcccg 240 
agcaccgcca agtcggccac ctggacgtat 300 
attgcaaaga catgccccat ccagatcaag 360 
atccgcgcca tgcctgtcta caaaaaagct 420 
cccaaccatg agctgagccg tgaattcaac 480 
attcgagtag aggggaacag ccatgcccag 540 
agtgtgctgg taccttatga gccaccccag 600 
aatttcatgt gtaacagcag ttgtgttgga 660 
gttactctgg aaaccagaga tgggcaagtc 720 
tgtgcttgcc caggaagaga caggaaggcg 780 
tcggacagta caaagaacgg tgatggtacg 840 
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aagcgcccgt ttcgtcagaa cacacatggt 
tccccagatg atgaactgtt atacttacca 
ttgaagatca aagagtccct ggaactcatg 
tacaggcaac agcaacagca gcagcaccag 
tctccatctt catatggtaa cagctcccca 
ctgccttctg tgagccagct tatcaaccct 
attcctgatg gcatgggagc caacattccc 
gacatgaatg gactcagccc cacccaggca 
tcccactgca cacccccacc tccgtatccc 
gtctga 

<210> 337 
<211> 1551 
<212> DNA 

<213> Homo sapiens 



atccagatga catccatcaa gaaacgaaga 900 

gtgaggggcc gtgagactta tgaaatgctg 960 

cagtaccttc ctcagcacac aattgaaacg 1020 

cacttacttc agaaacagac ctcaatacag 1080 

cctctgaaca aaatgaacag catgaacaag 1140 

cagcagcgca acgccctcac tcctacaacc 12 00 

atgatgggca cccacatgcc aatggctgga 1260 

ctccctcccc cactctccat gccatccacc 1320 

acagattgca gcattgtcag gatctggcaa 1380 

1386 



<400> 337 

atgtcccaga gcacacagac aaatgaattc ctcagtccag aggttttcca gcatatctgg 60 

gattttctgg aacagcctat atgttcagtt cagcccattg acttgaactt tgtggatgaa 120 

ccatcagaag atggtgcgac aaacaagatt gagattagca tggactgtat ccgcatgcag 180 

gactcggacc tgagtgaccc catgtggcca cagtacacga acctggggct cctgaacagc 240 

atggaccagc agattcagaa cggctcctcg tccaccagtc cctataacac agaccacgcg 300 

cagaacagcg tcacggcgcc ctcgccctac gcacagccca gctccacctt cgatgctctc 360 

tctccatcac ccgccatccc ctccaacacc gactacccag gcccgcacag tttcgacgtg 420 

tccttccagc agtcgagcac cgccaagtcg gccacctgga cgtattccac tgaactgaag 480 

aaactctact gccaaattgc aaagacatgc cccatccaga tcaaggtgat gaccccacct 540 

cctcagggag ctgttatccg cgccatgcct gtctacaaaa aagctgagca cgtcacggag 600 

gtggtgaagc ggtgccccaa ccatgagctg agccgtgaat tcaacgaggg acagattgcc 660 

cctcctagtc atttgattcg agtagagggg aacagccatg cccagtatgt agaagatccc 720 

atcacaggaa gacagagtgt gctggtacct tatgagccac cccaggttgg cactgaattc 780 

acgacagtct tgtacaattt catgtgtaac agcagttgtg ttggagggat gaaccgccgt 840 

ccaattttaa tcattgttac tctggaaacc agagatgggc aagtcctggg ccgacgctgc 900 

tttgaggccc ggatctgtgc ttgcccagga agagacagga aggcggatga agatagcatc 960 

agaaagcagc aagtttcgga cagtacaaag aacggtgatg gtacgaagcg cccgtttcgt 1020 

cagaacacac atggtatcca gatgacatcc atcaagaaac gaagatcccc agatgatgaa 1080 

ctgttatact taccagtgag gggccgtgag acttatgaaa tgctgttgaa gatcaaagag 1140 

tccctggaac tcatgcagta ccttcctcag cacacaattg aaacgtacag gcaacagcaa 1200 

cagcagcagc accagcactt acttcagaaa cagacctcaa tacagtctcc atcttcatat 12 60 

ggtaacagct ccccacctct gaacaaaatg aacagcatga acaagctgcc ttctgtgagc 1320 

cagcttatca accctcagca gcgcaacgcc ctcactccta caaccattcc tgatggcatg 1380 

ggagccaaca ttcccatgat gggcacccac atgccaatgg ctggagacat gaatggactc 1440 

agccccaccc aggcactccc tcccccactc tccatgccat ccacctccca ctgcacaccc 1500 

ccacctccgt atcccacaga ttgcagcatt gtcaggatct ggcaagtctg a 1551 

<210> 338 
<211> 586 
<212> PRT 

<213> Homo sapiens 
<400> 338 

Met Leu Tyr Leu Glu Asn Asn Ala Gin Thr Gin Phe Ser Glu Pro Gin 

15 10 15 

Tyr Thr Asn Leu Gly Leu Leu Asn Ser Met Asp Gin Gin lie Arg Asn 

20 25 30 

Gly Ser Ser Ser Thr Ser Pro Tyr Asn Thr Asp His Ala Gin Asn Ser 

35 40 45 

Val Thr Ala Pro Ser Pro Tyr Ala Gin Pro Ser Pro Thr Phe Asp Ala 
50 55 60 
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Leu 


Ser 


Pro 


Ser 


Pro 


Ala 


He 


Pro 


Ser 


Asn 


Thr 


Asp 


Tyr 


Pro 


Gly 


Pro 


65 










70 










75 










80 


His 


Ser 


Ser 


Asp 


Val 


Ser 


Phe 


Gin 


Gin 


Ser 


Ser 


Thr 


Ala 


Lys 


Ser 


Ala 










85 










90 










95 




Thr 


Trp 


Thr 


Tyr 


Ser 


Thr 


Glu 


Leu 


Lys 


Lys 


Leu 


Tyr 


Cys 


Gin 


He 


Ala 








100 










105 










110 






Lys 


Thr 


Cys 


Pro 


He 


Gin 


He 


Lys 


Val 


Met 


Thr 


Pro 


Pro 


Pro 


Gin 


Gly 






115 










120 










125 








Ala 


Val 


He 


Arg 


Ala 


Met 


Pro 


Val 


Tyr 


Lys 


Lys 


Ala 


Glu 


His 


Val 


Thr 




130 










135 










140 










Glu 


Val 


Val 


Lys 


Arg 


Cys 


Pro 


Asn 


His 


Glu 


Leu 


Ser 


Arg 


Glu 


Phe 


Asn 


145 










150 










155 










lbu 


Glu 


Gly 


Gin 


He 


Ala 


Pro 


Pro 


Ser 


His 


Leu 


lie 


Arg 


Val 


Glu 


Gly 


Asn 










165 










170 










1/5 




Ser 


His 


Ala 


Gin 


Tyr 


Val 


Glu 


Asp 


Pro 


He 


Thr 


Gly 


Arg 


Gin 


Ser 


Val 








180 










185 










190 






Leu 


Val 


Pro 


Tyr 


Glu 


Pro 


Pro 


Gin 


Val 


Gly 


Thr 


Glu 


Phe 


Thr 


Thr 


Val 






195 










200 










205 








Leu 


Tyr 


Asn 


Phe 


Met 


Cys 


Asn 


Ser 


Ser 


Cys 


Val 


Gly 


Gly 


Met 


Asn 


Arg 




210 










215 










220 










Arg 


Pro 


He 


Leu 


He 


lie 


Val 


Thr 


Leu 


Glu 


Thr 


Arg 


Asp 


Gly 


Gin 


Val 


225 










230 










235 










240 


Leu 


Gly 


Arg 


Arg 


Cys 


Phe 


Glu 


Ala 


Arg 


He 


Cys 


Ala 


Cys 


Pro 


Gly 


Arg 










245 










250 










255 




Asp 


Arg 


Lys 


Ala 


Asp 


Glu 


Asp 


Ser 


He 


Arg 


Lys 


Gin 


Gin 


Val 


Ser 


Asp 








260 










265 










270 






Ser 


Thr 


Lys 


Asn 


Gly 


Asp 


Gly 


Thr 


Lys 


Arg 


Pro 


Phe 


Arg 


Gin 


Asn 


Thr 






275 










280 










285 








His 


Gly 


He 


Gin 


Met 


Thr 


Ser 


He 


Lys 


Lys 


Arg 


Arg 


Ser 


Pro 


Asp 


Asp 




290 










295 










300 










Glu 


Leu 


Leu 


Tyr 


Leu 


Pro 


Val 


Arg 


Gly 


Arg 


Glu 


Thr 


Tyr 


Glu 


Met 


Leu 


305 










310 










315 










320 


Leu 


Lys 


He 


Lys 


Glu 


Ser 


Leu 


Glu 


Leu 


Met 


Gin 


Tyr 


Leu 


Pro 


Gin 


His 










325 










330 










335 




Thr 


He 


Glu 


Thr 


Tyr 


Arg 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


His 


Gin 


His 


Leu 








340 










345 










350 






Leu 


Gin 


Lys 


Gin 


Thr 


Ser 


He 


Gin 


Ser 


Pro 


Ser 


Ser 


Tyr 


Gly 


Asn 


Ser 






355 










360 










365 








Ser 


Pro 


Pro 


Leu 


Asn 


Lys 


Met 


Asn 


Ser 


Met 


Asn 


Lys 


Leu 


Pro 


Ser 


Val 




370 








375 










380 










Ser 


Gin 


Leu 


He 


Asn 


Pro 


Gin 


Gin 


Arg 


Asn 


Ala 


Leu 


Thr 


Pro 


Thr 


Thr 


385 










390 










395 










400 


lie 


Pro 


Asp 


Gly 


Met 


Gly 


Ala 


Asn 


He 


Pro 


Met 


Met 


Gly 


Thr 


His 


Met 








405 










410 










415 




Pro 


Met 


Ala 


Gly 


Asp 


Met 


Asn 


Gly 


Leu 


Ser 


Pro 


Thr 


Gin 


Ala 


Leu 


Pro 








420 










425 










430 






Pro 


Pro 


Leu 


Ser 


Met 


Pro 


Ser 


Thr 


Ser 


His 


Cys 


Thr 


Pro 


Pro 


Pro 


Pro 






435 










440 










445 








Tyr 


Pro 


Thr 


Asp 


Cys 


Ser 


He 


Val 


Ser 


Phe 


Leu 


Ala 


Arg 


Leu 


Gly 


Cys 




450 










455 










460 










Ser 


Ser 


Cys 


Leu 


Asp 


Tyr 


Phe 


Thr 


Thr 


Gin 


Gly 


Leu 


Thr 


Thr 


He 


Tyr 


465 










470 










475 










480 


Gin 


He 


Glu 


His 


Tyr 


Ser 


Met 


Asp 


Asp 


Leu 


Ala 


Ser 


Leu 


Lys 


He 


Pro 










485 










490 










495 




Glu 


Gin 


Phe 


Arg 


His 


Ala 


He 


Trp 


Lys 


Gly 


He 


Leu 


Asp 


His 


Arg 


Gin 








500 










505 










510 






Leu 


His 


Glu 


Phe 


Ser 


Ser 


Pro 


Ser 


His 


Leu 


Leu 


Arg 


Thr 


Pro 


Ser 


Ser 






515 










520 










525 
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Ala Ser Thr Val Ser Val Gly Ser 

530 535 
He Asp Ala Val Arg Phe Thr Leu 
545 550 
Arg Asp Glu Trp Asn Asp Phe Asn 
565 

Lys Gin Gin Arg He Lys Glu Glu 
580 



Ser Glu Thr Arg Gly Glu Arg Val 
540 

Arg Gin Thr He Ser Phe Pro Pro 
555 560 
Phe Asp Met Asp Ala Arg Arg Asn 

570 575 
Gly Glu 
585 



<210> 339 
<211> 641 
<212> PRT 

<213> Homo sapiens 
<400> 339 

Met Ser Gin Ser Thr Gin Thr Asn Glu Phe Leu Ser Pro Glu Val Phe 

15 10 15 

Gin His He Trp Asp Phe Leu Glu Gin Pro He Cys Ser Val Gin Pro 

20 25 30 

He Asp Leu Asn Phe Val Asp Glu Pro Ser Glu Asp Gly Ala Thr Asn 

35 40 45 

Lys He Glu He Ser Met Asp Cys He Arg Met Gin Asp Ser Asp Leu 

50 55 60 

Ser Asp Pro Met Trp Pro Gin Tyr Thr Asn Leu Gly Leu Leu Asn Ser 
65 70 75 80 

Met Asp Gin Gin He Gin Asn Gly Ser Ser Ser Thr Ser Pro Tyr Asn 

85 90 95 

Thr Asp His Ala Gin Asn Ser Val Thr Ala Pro Ser Pro Tyr Ala Gin 

100 105 110 

Pro Ser Ser Thr Phe Asp Ala Leu Ser Pro Ser Pro Ala He Pro Ser 

115 120 125 

Asn Thr Asp Tyr Pro Gly Pro His Ser Phe Asp Val Ser Phe Gin Gin 

130 ~* 135 140 

Ser Ser Thr Ala Lys Ser Ala Thr Trp Thr Tyr Ser Thr Glu Leu Lys 
145 150 155 160 

Lys Leu Tyr Cys Gin He Ala Lys Thr Cys Pro He Gin He Lys Val 

165 170 175 

Met Thr Pro Pro Pro Gin Gly Ala Val He Arg Ala Met Pro Val Tyr 

180 185 190 

Lys Lys Ala Glu His Val Thr Glu Val Val Lys Arg Cys Pro Asn His 

195 200 205 

Glu Leu Ser Arg Glu Phe Asn Glu Gly Gin He Ala Pro Pro Ser His 

210 215 220 

Leu He Arg Val Glu Gly Asn Ser His Ala Gin Tyr Val Glu Asp Pro 
225 230 235 240 

He Thr Gly Arg Gin Ser Val Leu Val Pro Tyr Glu Pro Pro Gin Val 

245 250 255 

Gly Thr Glu Phe Thr Thr Val Leu Tyr Asn Phe Met Cys Asn Ser Ser 

260 265 270 

Cys Val Gly Gly Met Asn Arg Arg Pro He Leu He He Val Thr Leu 

275 280 285 

Glu Thr Arg Asp Gly Gin Val Leu Gly Arg Arg Cys Phe Glu Ala Arg 

290 ~ 295 300 

He Cys Ala Cys Pro Gly Arg Asp Arg Lys Ala Asp Glu Asp Ser He 
305 . 310 315 320 

Arg Lys Gin Gin Val Ser Asp Ser Thr Lys Asn Gly Asp Gly Thr Lys 
325 330 335 
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Arg 


Pro 


Phe Arg 


Gin Asn Thr 


His 


Gly He 


Gin 


Met Thr Ser He Lys 






340 






345 




350 


Lys 


Arg 


Arg Ser 


Pro Asp Asp Glu Leu Leu 


Tyr 


Leu Pro Val Arg Gly 






355 




360 






365 


Arg 


Glu 


Thr Tyr 


Glu Met Leu 


Leu 


Lys He 


Lys 


Glu Ser Leu Glu Leu 




370 




375 








380 


Met 


Gin 


Tyr Leu 


Pro Gin His 


Thr 


lie Glu 


Thr 


Tyr Arg Gin Gin Gin 


385 






390 






395 


400 


Gin 


Gin 


Gin His 


Gin His Leu Leu Gin Lys 


Gin 


Thr Ser He Gin Ser 








405 




410 




415 


Pro 


Ser 


Ser Tyr 


Gly Asn Ser 


Ser 


Pro Pro 


Leu 


Asn Lys Met Asn Ser 






420 






425 




430 


Met 


Asn 


Lys Leu 


Pro Ser Val 


Ser 


Gin Leu 


He 


Asn Pro Gin Gin Arg 






435 




440 






445 


Asn 


Ala 


Leu Thr 


Pro Thr Thr 


He 


Pro Asp 


Gly 


Met Gly Ala Asn He 




450 




455 








460 


Pro 


Met 


Met Gly 


Thr His Met 


Pro 


Met Ala 


Gly 


Asp Met Asn Gly Leu 


465 






470 






475 


480 


Ser 


Pro 


Thr Gin 


Ala Leu Pro 


Pro 


Pro Leu 


Ser 


Met Pro Ser Thr Ser 








485 




490 




495 


His 


Cys 


Thr Pro 


Pro Pro Pro 


Tyr 


Pro Thr 


Asp 


Cys Ser He Val Gly 






500 






505 




510 


Phe 


Leu 


Ala Arg 


Leu Gly Cys 


Ser Ser Cys 


Leu 


Asp Tyr Phe Thr Thr 






515 




520 






525 


Gin 


Gly 


Leu Thr 


Thr He Tyr Gin He Glu 


His 


Tyr Ser Met Asp Asp 




530 




535 








540 


Leu 


Ala 


Ser Leu 


Lys He Pro 


Glu 


Gin Phe 


Arg 


His Ala He Trp Lys 


545 






550 






555 


560 


Gly 


He 


Leu Asp 


His Arg Gin Leu His Glu 


Phe 


Ser Ser Pro Ser His 








565 




570 




575 


Leu 


Leu 


Arg Thr 


Pro Ser Ser 


Ala 


Ser Thr 


Val 


Ser Val Gly Ser Ser 






580 






585 




590 


Glu 


Thr 


Arg Gly 


Glu Arg Val 


He Asp Ala 


Val 


Arg Phe Thr Leu Arg 






595 




600 






605 


Gin 


Thr 


He Ser 


Phe Pro Pro Arg Asp Glu 


Trp 


Asn Asp Phe Asn Phe 




610 




615 








620 


Asp 


Met 


Asp Ala 


Arg Arg Asn Lys 


Gin Gin 


Arg 


He Lys Glu Glu Gly 


625 






630 






635 


640 


Glu 

















<210> 340 
<211> 448 
<212> PRT 

<213> Homo sapiens 
<400> 340 

Met Ser Gin Ser Thr Gin Thr Asn 

1 5 
Gin His He Trp Asp Phe Leu Glu 
20 

He Asp Leu Asn Phe Val Asp Glu 

35 40 
Lys He Glu He Ser Met Asp Cys 

50 55 
Ser Asp Pro Met Trp Pro Gin Tyr 
65 70 



Glu 


Phe 


Leu 


Ser Pro Glu Val Phe 




10 




15 


Gin 


Pro 


He 


Cys Ser Val Gin Pro 


25 






30 


Pro 


Ser 


Glu 


Asp Gly Ala Thr Asn 








45 


He Arg 


Met 


Gin Asp Ser Asp Leu 








60 


Thr 


Asn 


Leu 


Gly Leu Leu Asn Ser 






75 


80 



WO 02/47534 



147 



PCT/US01/47576 



Met Asp Gin Gin lie Gin Asn Gly Ser Ser Ser Thr Ser Pro Tyr Asn 

85 90 95 

Thr Asp His Ala Gin Asn Ser Val Thr Ala Pro Ser Pro Tyr Ala Gin 

100 ( 105 HO 

Pro Ser Ser Thr Phe Asp Ala Leu Ser Pro Ser Pro Ala He Pro Ser 

115 120 125 

Asn Thr Asp Tyr Pro Gly Pro His Ser Phe Asp Val Ser Phe Gin Gin 

130 135 140 

Ser Ser Thr Ala Lys Ser Ala Thr Trp Thr Tyr Ser Thr Glu Leu Lys 
145 150 155 160 

Lys Leu Tyr Cys Gin He Ala Lys Thr Cys Pro He Gin He Lys Val 

165 170 175 

Met Thr Pro Pro Pro Gin Gly Ala Val He Arg Ala Met Pro Val Tyr 

180 185 190 

Lys Lys Ala Glu His Val Thr Glu Val Val Lys Arg Cys Pro Asn His 

195 200 205 

Glu Leu Ser Arg Glu Phe Asn Glu Gly Gin He Ala Pro Pro Ser His 

210 215 220 

Leu He Arg Val Glu Gly Asn Ser His Ala Gin Tyr Val Glu Asp Pro 
225 230 235 240 

He Thr Gly Arg Gin Ser Val Leu Val Pro Tyr Glu Pro Pro Gin Val 

245 250 255 

Gly Thr Glu Phe Thr Thr Val Leu Tyr Asn Phe Met Cys Asn Ser Ser 

260 265 270 

Cys Val Gly Gly Met Asn Arg Arg Pro He Leu He He Val Thr Leu 

275 280 285 

Glu Thr Arg Asp Gly Gin Val Leu Gly Arg Arg Cys Phe Glu Ala Arg 

290 ~ 295 300 

He Cys Ala Cys Pro Gly Arg Asp Arg Lys Ala Asp Glu Asp Ser He 
305 310 315 320 

Arg Lys Gin Gin Val Ser Asp Ser Thr Lys Asn Gly Asp Gly Thr Lys 

325 330 335 

Arg Pro Phe Arg Gin Asn Thr His Gly He Gin Met Thr Ser He Lys 

340 345 350 

Lys Arg Arg Ser Pro Asp Asp Glu Leu Leu Tyr Leu Pro Val Arg Gly 

355 360 365 

Arg Glu Thr Tyr Glu Met Leu Leu Lys He Lys Glu Ser Leu Glu Leu 

370 375 380 

Met Gin Tyr Leu Pro Gin His Thr He Glu Thr Tyr Arg Gin Gin Gin 
385 390 395 400 

Gin Gin Gin His Gin His Leu Leu Gin Lys His Leu Leu Ser Ala Cys 

405 410 415 

Phe Arg Asn Glu Leu Val Glu Pro Arg Arg Glu Thr Pro Lys Gin Ser 

420 425 430 

Asp Val Phe Phe Arg His Ser Lys Pro Pro Asn Arg Ser Val Tyr Pro 
435 440 445 



<210> 341 
<211> 356 
<212> PRT 

<213> Homo sapiens 
<400> 341 

Met Leu Tyr Leu Glu Asn Asn Ala Gin Thr Gin Phe Ser Glu Pro Gin 

15 10 15 

Tyr Thr Asn Leu Gly Leu Leu Asn Ser Met Asp Gin Gin He Gin Asn 
20 " 25 30 
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Gly 


Ser 


Ser 


Ser 


Thr 


Ser 


Pro Tyr Asn 


Thr 


Asp 


His 


Ala 


Gin 


Asn 


Ser 






35 








40 








45 








Val 


Thr 


Ala 


Pro 


Ser 


Pro 


Tyr Ala Gin 


Pro 


Ser 


Ser 


Thr 


Phe 


Asp 


Ala 




50 










55 






60 










Leu 


Ser 


Pro 


Ser 


Pro 


Ala 


He Pro Ser 


Asn 


Thr 


Asp 


Tyr 


Pro 


Gly 


Pro 


65 










70 






75 










80 


His 


Ser 


Phe 


Asp 


Val 


Ser 


Phe Gin Gin 


Ser 


Ser 


Thr 


Ala 


Lys 


Ser 


Ala 










85 






90 










95 




Thr 


Trp 


Thr 


Tyr 


Ser 


Thr 


Glu Leu Lys 


Lys 


Leu 


Tyr 


Cys 


Gin 


He 


Ala 








100 






105 










110 






Lys 


Thr 


Cys 


Pro 


lie 


Gin 


He Lys Val 


Met 


Thr 


Pro 


Pro 


Pro 


Gin 


Gly 






115 








120 








125 








Ala 


Val 


He 


Arg 


Ala 


Met 


Pro Val Tyr 


Lys 


Lys 


Ala 


Glu 


His 


Val 


Thr 




130 










135 






140 










Glu 


Val 


Val 


Lys 


Arg 


Cys 


Pro Asn His 


Glu 


Leu 


Ser 


Arg 


Glu 


Phe 


Asn 


145 










150 






155 










loU 


Glu 


Gly 


Gin 


He 


Ala 


Pro 


Pro Ser His 


Leu 


He 


Arg 


Val 


Glu 


Gly 


Asn 










165 






170 










175 




Ser 


His 


Ala 


Gin 


Tyr 


Val 


Glu Asp Pro 


He 


Thr 


Gly 


Arg 


Gin 


Ser 


Val 








180 






185 










190 






Leu 


Val 


Pro 


Tyr 


Glu 


Pro 


Pro Gin Val 


Gly 


Thr 


Glu 


Phe 


Thr 


Thr 


Val 






195 






200 








205 








Leu 


Tyr 


Asn 


Phe 


Met Cys 


Asn Ser Ser 


Cys 


Val 


Gly 


Gly 


Met 


Asn 


Arg 




210 










215 






220 










Arg 


Pro 


He 


Leu 


He 


He 


Val Thr Leu 


Glu 


Thr 


Arg 


Asp 


Gly 


Gin 


Val 


225 










230 






235 










240 


Leu 


Gly 


Arg 


Arg 


Cys 


Phe 


Glu Ala Arg 


He 


Cys 


Ala 


Cys 


Pro 


Gly 


Arg 










245 






250 










255 




Asp 


Arg 


Lys 


Ala 


Asp 


Glu 


Asp Ser He 


Arg 


Lys 


Gin 


Gin 


Val 


Ser 


Asp 








ZdU 






ZbD 










OTA 






Ser 


Thr 


Lys 


Asn 


Gly Asp 


Gly Thr Lys 


Arg 


Pro 


Ser 


Arg 


Gin 


Asn 


Thr 






275 








280 








285 








His 


Gly 


He 


Gin 


Met 


Thr 


Ser He Lys 


Lys 


Arg 


Arg 


Ser 


Pro 


Asp 


Asp 




290 










295 






300 










Glu 


Leu 


Leu 


Tyr 


Leu 


Pro 


Val Arg Gly 


Arg 


Glu 


Thr 


Tyr 


Glu 


Met 


Leu 


305 










310 






315 










320 


Leu 


Lys 


He 


Lys 


Glu 


Ser 


Leu Glu Leu 


Met 


Gin 


Tyr 


Leu 


Pro 


Gin 


His 








325 






330 










335 




Thr 


He 


Glu 


Thr 


Tyr Arg 


Gin Gin Gin 


Gin 


Gin 


Gin 


His 


Gin 


His 


Leu 








340 






345 










350 






Leu 


Gin 


Lys 


Gin 























355 



<210> 342 
<211> 680 
<212> PRT 

<213> Homo sapiens 
<400> 342 



Met Asn Phe 


Glu Thr Ser 


Arg Cys 


Ala 


Thr 


Leu 


Gin 


Tyr 


Cys 


Pro 


Asp 


1 


5 






10 










15 




Pro Tyr He 


Gin Arg Phe 


Val Glu 


Thr 


Pro 


Ala 


His 


Phe 


Ser 


Trp 


Lys 




20 




25 










30 






Glu Ser Tyr 


Tyr Arg Ser 


Thr Met 


Ser 


Gin 


Ser 


Thr 


Gin 


Thr 


Asn 


Glu 


35 




40 










45 








Phe Leu Ser 


Pro Glu Val 


Phe Gin 


His 


He 


Trp 


Asp 


Phe 


Leu 


Glu 


Gin 


50 




55 








60 
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Pro He Cys Ser Val Gin Pro He Asp Leu Asn Phe Val Asp Glu Pro 
65 ' 70 75 80 

Ser Glu Asp 'Gly Ala Thr Asn Lys He Glu He Ser Met Asp Cys He 

85 90 95 

Arg Met Gin Asp Ser Asp Leu Ser Asp Pro Met Trp Pro Gin Tyr Thr 

100 105 110 

Asn Leu Gly Leu Leu Asn Ser Met Asp Gin Gin He Gin Asn Gly Ser 

115 120 125 

Ser Ser Thr Ser Pro Tyr Asn Thr Asp His Ala Gin Asn Ser Val Thr 

130 135 140 

Ala Pro Ser Pro Tyr Ala Gin Pro Ser Ser Thr Phe Asp Ala Leu Ser 
145 150 155 160 

Pro Ser Pro Ala He Pro Ser Asn Thr Asp Tyr Pro Gly Pro His Ser 

165 170 175 

Phe Asp Val Ser Phe Gin Gin Ser Ser Thr Ala Lys Ser Ala Thr Trp 

180 185 190 

Thr Tyr Ser Thr Glu Leu Lys Lys Leu Tyr Cys Gin He Ala Lys Thr 

195 200 205 

Cys Pro He Gin He Lys Val Met Thr Pro Pro Pro Gin Gly Ala Val 

210 215 220 

He Arg Ala Met Pro Val Tyr Lys Lys Ala Glu His Val Thr Glu Val 
225 230 235 240- 

Val Lys Arg Cys Pro Asn His Glu Leu Ser Arg Glu Phe Asn Glu Gly 

245 250 255 

Gin He Ala Pro Pro Ser His Leu He Arg Val Glu Gly Asn Ser His 

260 265 270 

Ala Gin Tyr Val Glu Asp Pro He Thr Gly Arg Gin Ser Val Leu Val 

275 280 285 

Pro Tyr Glu Pro Pro Gin Val Gly Thr Glu Phe Thr Thr Val Leu Tyr 

290 295 300 

Asn Phe Met Cys Asn Ser Ser Cys Val Gly Gly Met Asn Arg Arg Pro 
305 310 315 320 

He Leu He He Val Thr Leu Glu Thr Arg Asp Gly Gin Val Leu Gly 

325 330 335 

Arg Arg Cys Phe Glu Ala Arg He Cys Ala Cys Pro Gly Arg Asp Arg 

340 345 350 

Lys Ala Asp Glu Asp Ser He Arg Lys Gin Gin Val Ser Asp Ser Thr 

355 360 365 

Lys Asn Gly Asp Gly Thr Lys Arg Pro Phe Arg Gin Asn Thr His Gly 

370 375 380 

He Gin Met Thr Ser He Lys Lys Arg Arg Ser Pro Asp Asp Glu Leu 
385 390 395 400 

Leu Tyr Leu Pro Val Arg Gly Arg Glu Thr Tyr Glu Met Leu Leu Lys 

405 410 415 

lie Lys Glu Ser Leu Glu Leu Met Gin Tyr Leu Pro Gin His Thr He 

420 425 430 

Glu Thr Tyr Arg Gin Gin Gin Gin Gin Gin His Gin His Leu Leu Gin 

435 440 445 

Lys Gin Thr Ser He Gin Ser Pro Ser Ser Tyr Gly Asn Ser Ser Pro 

450 455 460 

Pro Leu Asn Lys Met Asn Ser Met Asn Lys Leu Pro Ser Val Ser Gin 
465 J 470 475 480 

Leu He Asn Pro Gin Gin Arg Asn Ala Leu Thr Pro Thr Thr He Pro 

485 490 495 

Asp Gly Met Gly Ala Asn He Pro Met Met Gly Thr His Met Pro Met 

500 505 510 

Ala Gly Asp Met Asn Gly Leu Ser Pro Thr Gin Ala Leu Pro Pro Pro 
515 520 525 
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Leu 


Ser 


Met 


Pro 


Ser 


Thr Ser 


Gin 




530 








535 




Thr 


Asp 


Cys 


Ser 


lie 


Val Ser 


Phe 


545 


- 








550 




Cys 


Leu 


Asp 


Tyr 


Phe 


Thr Thr 


Gin 










565 






Glu 


His 


Tyr 


Ser 


Met Asp Asp 


Leu 








580 








Phe 


Arg 


His 


Ala 


He 


Trp Lys 


Gly 






595 








600 


Glu 


Phe 


Ser 


Ser 


Pro 


Ser His 


Leu 




610 








615 




Thr 


Val 


Ser 


Val 


Gly 


Ser Ser 


Glu 


625 










630 




Ala 


Val 


Arg 


Phe 


Thr 


Leu Arg 


Gin 










645 






Glu 


Trp 


Asn 


Asp 


Phe 


Asn Phe 


Asp 








660 








Gin 


Arg 


He 


Lys 


Glu Glu Gly 


Glu 






675 








680 



<210> 343 
<211> 461 
<212> PRT 

<213> Homo sapiens 



<400> 343 



Met 


Leu 


Tyr 


Leu 


Glu Asn 


Asn 


Ala 


1 








5 






Tyr 


Thr 


Asn 


Leu 


Gly Leu Leu 


Asn 








20 








Gly 


Ser 


Ser 


Ser 


Thr Ser 


Pro 


Tyr 






35 








40 


Val 


Thr 


Ala 


Pro 


Ser Pro Tyr 


Ala 




50 








55 




Leu 


Ser 


Pro 


Ser 


Pro Ala 


He 


Pro 


65 








70 






His 


Ser 


Phe 


Asp 


Val Ser 


Phe 


Gin 










85 






Thr 


Trp 


Thr 


Tyr 


Ser Thr 


Glu 


Leu 








100 








Lys 


Thr 


Cys 


Pro 


He Gin 


He 


Lys 






115 








120 


Ala 


Val 


He 


Arg 


Ala Met 


Pro 


Val 




130 








.135 




Glu 


Val 


Val 


Lys 


Arg Cys 


Pro 


Asn 


145 








150 






Glu 


Gly 


Gin 


He 


Ala Pro 


Pro 


Ser 










165 






Ser 


His 


Ala 


Gin 


Tyr Val Glu 


Asp 








180 








Leu 


Val 


Pro 


Tyr 


Glu Pro 


Pro 


Gin 






195 








200 


Leu 


Tyr 


Asn 


Phe 


Met Cys Asn 


Ser 




210 








215 




Arg 


Pro 


He 


Leu 


He He 


Val 


Thr 


225 








230 























Cys 


Thr 


Pro 


Pro 


Pro 


Pro 


Tyr 


Pro 








C A A 










T All 

Leu 


Ala 


Arg 


Leu 


Gly 


Cys 


Ser 


ber 






boo 










ccn 


Gly 


Leu 


Thr 


Thr 


lie 


Tyr 


Gin 


lie 




0 /U 










DID 




Ala 


Ser 


Leu 


Lys 


He 


Pro 


Glu 


Gin 


c o c 
585 










con 






lie 


Leu 


Asp 


His 


Arg 


Gin 


Leu 


HIS 


















Leu 


Arg 


Thr 


Pro 


Ser 


Ser 


Ala 


Ser 








620 










Thr 


Arg 


Gly 


Glu 


Arg 


Val 


He 


Asp 






635 










640 


Thr 


lie 


Ser 


Phe 


Pro 


Pro 


Arg 


Asp 














(TCC 

DDj 




Met 


Asp 


Ala 


Arg 


Arg 


Asn 


Lys 


Gin 


665 










670 






bin 


inr 


bin 


rne 




blU 


Pro 


bin 














10 




Ser 


Met 


Asp 


Gin 


Gin 


Tl ~ 

lie 


Gin 


Asn 


o c 










*an 
ou 






Asn 


inr 


Asp 


nlS 


Aia 


bin 


Asn 


□er 










ID 








Gin 


Pro 


Ser 


Ser 


Thr 


pne 


Asp 


Ala 








OU 










Ser 


Asn 


Thr 


Asp 


Tyr 


Pro 


Gly 


Pro 






/ o 










o u 


«in 


oer 


Ser 


inr 


Til a 

Aia 


Lys 


Dei 


HI a 














QC 




Lys 


Lys 


T All 

Leu 


Tyr 


Cys 


bin 


Tl- 
ilS 


Aid 


1UO 










1 1 n 






vai 


Met 


Thr 


Pro 


Pro 


Pro 


bin 


biy 










IOC 








Tyr 


Lys 


Lys 


Ala 


Glu 


His 


val 


Thr 








140 










His 


Glu 


Leu 


Ser 


Arg 


Glu 


Phe 


Asn 






ICC 










i ou 


His 


Leu 


He 


Arg 


Val 


Glu 


Gly 


Asn 




1/0 










1 / D 




Pro 


He 


Thr 


Gly 


Arg 


Gin 


Ser 


Val 


185 










190 






Val 


Gly 


Thr 


Glu 


Phe 


Thr 


Thr 


Val 










205 








Ser 


Cys 


Val 


Gly 


Gly 


Met 


Asn 


Arg 








220 










Leu 


Glu 


Thr 


Arg 


Asp 


Gly 


Gin 


Val 






235 










240 
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Leu Gly 

Asp Arg 

Ser Thr 

His Gly 
290 
Glu Leu 
305 

Leu Lys 

Thr He 

Leu Gin 

Ser Pro 
370 
Ser Gin 
385 

He Pro 

Pro Met 

Pro Pro 

Tyr Pro 
450 



Arg Arg Cys 
245 

Lys Ala Asp 

260 
Lys Asn Gly 
275 

He Gin Met 

Leu Tyr Leu 

He Lys Glu 
325 

Glu Thr Tyr 

340 
Lys Gin Thr 
355 

Pro Leu Asn 

Leu He Asn 

Asp Gly Met 
405 

Ala Gly Asp 

420 
Leu Ser Met 
435 

Thr Asp Cys 



Phe Glu 

Glu Asp 

Asp Gly 

Thr Ser 
295 
Pro Val 
310 

Ser Leu 

Arg Gin 

Ser He 

Lys Met 
375 
Pro Gin 
390 

Gly Ala 

Met Asn 

Pro Ser 

Ser lie 
455 



Ala Arg He 
250 

Ser He Arg 

265 
Thr Lys Arg 
280 

He Lys Lys 

Arg Gly Arg 

Glu Leu Met 
330 

Gin Gin Gin 

345 
Gin Ser Pro 
360 

Asn Ser Met 

Gin Arg Asn 

Asn He Pro 
410 

Gly Leu Ser 

425 
Thr Ser His 
440 

Val Arg He 



Cys Ala Cys Pro 
Lys Gin 
Pro Phe 



Arg Arg 
300 
Glu Thr 
315 

Gin Tyr 

Gin Gin 

Ser Ser' 

Asn Lys 
380 

Ala Leu 
395 

Met Met 

Pro Thr 

Cys Thr 

Trp Gin 
460 



Gin Val 
270 
Arg Gin 
285 

Ser Pro 



Tyr Glu 

Leu' Pro 

His Gin 
350 
Tyr Gly 
365 

Leu Pro 

Thr Pro 

Gly Thr 

Gin Ala 
430 
Pro Pro 
445 
Val 



Gly Arg 
255 

Ser Asp 

Asn Thr 

Asp Asp 

Met Leu 
320 
Gin His 
335 

His Leu 

Asn Ser 

Ser Val 

Thr Thr 
400 
His Met 
415 

Leu Pro 
Pro Pro 



<210> 344 
<211> 516 
<212> PRT 

<213> Homo sapiens 



<400> 344 



Met 


Ser 


Gin 


Ser 


Thr Gin Thr Asn Glu Phe Leu Ser Pro Glu 


Val 


Phe 


1 








5 10 


15 




Gin 


His 


He 


Trp 
20 


Asp Phe Leu Glu Gin Pro He Cys Ser Val 
25 30 


Gin 


Pro 


He 


Asp 


Leu 


Asn 


Phe Val Asp Glu Pro Ser Glu Asp Gly Ala 


Thr 


Asn 




35 




40 45 






Lys 


He 


Glu 


He 


Ser Met Asp Cys He Arg Met Gin Asp Ser 


Asp 


Leu 


50 






55 " ~ 60 






Ser 


Asp 


Pro 


Met 


Trp Pro Gin Tyr Thr Asn Leu Gly Leu Leu Asn 


Ser 


65 








70 75 




80 


Met 


Asp 


Gin 


Gin 


He Gin Asn Gly Ser Ser Ser Thr Ser Pro 


Tyr Asn 








85 90 


95 




Thr 


Asp 


His 


Ala 


Gin Asn Ser Val Thr Ala Pro Ser Pro Tyr 


Ala 


Gin 






100 


105 110 






Pro 


Ser 


Ser 
115 


Thr 


Phe Asp Ala Leu Ser Pro Ser Pro Ala He 
120 125 


Pro 


Ser 


Asn 


Thr 
130 


Asp 


Tyr 


Pro Gly Pro His Ser Phe Asp Val Ser Phe 
135 140 


Gin 


Gin 


Ser 


Ser 


Thr 


Ala 


Lys Ser Ala Thr Trp Thr Tyr Ser Thr Glu Leu Lys 


145 








150 155 




160 


Lys 


Leu 


Tyr 


Cys 


Gin He Ala Lys Thr Cys Pro He Gin He 
165 170 


Lys 
175 


Val 
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152 



Met 


Thr 


Pro 


Pro 


Pro 


Gin Gly Ala Val 


He 


Arg 


Ala 


Met 


Pro 


Val 


Tyr 








180 








185 










190 






Lys 


Lys 


Ala 


Glu 


His 


Val 


Thr Glu 


Val 


Val 


Lys 


Arg 


Cys 


Pro 


Asn 


His 






195 








200 










205 








Glu 


Leu 


Ser 


Arg 


Glu 


Phe Asn Glu Gly 


Gin 


He 


Ala 


Pro 


Pro 


Ser 


His 




210 










215 








220 










Leu 


He 


Arg 


Val 


Glu 


Gly Asn Ser His 


Ala 


Gin 


Tyr 


Val 


Glu 


Asp 


Pro 


225 










230 








235 










240 


He 


Thr 


Gly 


Arg 


Gin 


Ser 


Val Leu 


Val 


Pro 


Tyr 


Glu 


Pro 


Pro 


Gin 


Val 










245 








250 










255 




Gly 


Thr 


Glu 


Phe 


Thr 


Thr 


Val Leu 


Tyt 


Asn 


Phe 


Met 


Cys 


Asn 


Ser 


Ser 








260 








265 










270 






Cys 


Val 


Gly 


Gly 


Met 


Asn Arg Arg 


Pro 


He 


Leu 


He 


He 


Val 


Thr 


Leu 






275 








280 










285 








Glu 


Thr 


Arg 


Asp 


Gly 


Gin 


Val Leu 


Gly 


Arg 


Arg 


Cys 


Phe 


Glu 


Ala 


Arg 




290 










295 








300 










He 


Cys 


Ala 


Cys 


Pro 


Gly Arg Asp Arg 


Lys 


Ala 


Asp 


Glu 


Asp 


Ser 


He 


305 










310 








315 










320 


Arg 


Lys 


Gin 


Gin 


Val 


Ser Asp Ser Thr 


Lys 


Asn 


Gly 


Asp 


Gly 


Thr 


Lys 










325 








330 










335 




Arg 


Pro 


Phe 


Arg 


Gin 


Asn Thr His Gly 


He 


Gin 


Met 


Thr 


Ser 


He 


Lys 








340 








345 










350 






Lys 


Arg 


Arg 


Ser 


Pro 


Asp Asp Glu Leu 


Leu 


Tyr 


Leu 


Pro 


Val 


Arg 


Gly 






355 








360 










365 








Arg 


Glu 


Thr 


Tyr 


Glu 


Met 


Leu Leu Lys 


He 


Lys 


Glu 


Ser 


Leu 


Glu 


Leu 




370 










375 








380 










Met 


Gin 


Tyr 


Leu 


Pro 


Gin 


His Thr 


He 


Glu 


Thr' 


Tyr 


Arg 


Gin 


Gin 


Gin 


385 










390 








395 










400 


Gin 


Gin 


Gin 


His 


Gin 


His 


Leu Leu 


Gin 


Lys 


Gin 


Thr 


Ser 


He 


Gin 


Ser 










405 








410 










415 




Pro 


Ser 


Ser 


Tyr 


Gly 


Asn 


Ser Ser 


Pro 


Pro 


Leu 


Asn 


Lys 


Met 


Asn 


Ser 
















425 










A *3fi 

4JU 






Met 


Asn 


Lys 


Leu 


Pro 


Ser 


Val Ser 


Gin 


Leu 


He 


Asn 


Pro 


Gin 


Gin 


Arg 






435 








440 










445 








Asn 


Ala 


Leu 


Thr 


Pro 


Thr 


Thr He 


Pro 


Asp 


Gly 


Met 


Gly 


Ala 


Asn 


lie 




450 










455 








460 










Pro 


Met 


Met 


Gly 


Thr 


His 


Met Pro 


Met 


Ala 


Gly 


Asp 


Met 


Asn 


Gly 


Leu 


465 










470 








475 










480 


Ser 


Pro 


Thr 


Gin 


Ala 


Leu 


Pro Pro 


Pro 


Leu 


Ser 


Met 


Pro 


Ser 


Thr 


Ser 










485 








490 










495 




His 


Cys 


Thr 


Pro 


Pro 


Pro 


Pro Tyr 


Pro 


Thr 


Asp 


Cys 


Ser 


He 


Val 


Arg 








500 








505 










510 






He 


Trp 


Gin 


Val 

























515 



<210> 345 

<211> 1800 

<212> DNA 

<213> Homo sapiens 

<400> 345 

gcgcctcatt gccactgcag tgactaaagc 
actggttgtt ttttaaacaa . attctgatac 
tgacattcgt atcatcactg tgcaccattg 
ggaggtctga aaccctcgca gagggatctt 
agtcgttgga aacaggactc agggataaac 
tttcatcggg ggtgtcaaca aacactccac 



tgggaagacg ctggtcagtt cacctgcccc 60 
aggcgacatc ctcactgacc gagcaaagat 120 
gcttctaggc actccagtgg ggtaggagaa 180 
gccctcattc tttgggtctg aaacactggc 240 
cagcgcaatg gattggggga cgctgcacac 300 
cagcatcggg aaggtgtgga tcacagtcat 360 
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ctttattttc cgagtcatga tcctagtggt 
agaggacttc gtctgcaaca cactgcaacc 
tttcccggtg tcccacatcc ggctgtgggc 
gctgctggtg gccatgcatg tggcctacta 
aggagagaag aggaatgatt tcaaagacat 
agaggggtcg ctgtggtgga cgtacaccag 
agcctttatg tatgtgtttt acttccttta 
atgtgggatt gacccctgcc ccaaccttgt 
gaccgtgttt accattttta tgatttctgc 
agagttgtgc tacctgctgc tgaaagtgtg 
aaaaaatcac cccaatcatg ccctaaagga 
ttcagatagt ggtcaaaatg caatcacagg 
gtagctgcgt cataaggaga cttctgtctt 
cttctgtagc ctgaagagtt tgtaaatgac 
ttttgtagga tacttgctcc attcatacac 
aaacaagaga ctgcttgaca aaggagcatt 
gactctctga caaagtgggt actttctgaa 
tttatccagg aattgatacg tttattagga 
agttccgact ttgaatttat ataaagtatt 
aaacatgcga tgttagtttt agaattacac 
ggtcctatct tgtaaatatt gttttgcatt 
tacgcttaag gtgggaaagt gttcattgca 
acggaacagt gtggaagcag aaggcttttt 
ccactgggag atgtggatgt ggttgcctcc 



ggctgcccag gaagtgtggg gtgacgagca 420 
gggatgcaaa aatgtgtgct atgaccactt 480 
cctccagctg atcttcgtct ccaccccagc 540 
caggcacgaa accactcgca agttcaggcg 600 
agaggacatt aaaaagcaca aggttcggat 660 
cagcatcttt ttccgaatca tctttgaagc 720 
caatgggtac cacctgccct gggtgttgaa 780 
tgactgcttt atttctaggc caacagagaa 840 
gtctgtgatt tgcatgctgc ttaacgtggc 900 
ttttaggaga tcaaagagag cacagacgca 960 
gagtaagcag aatgaaatga atgagctgat 1020 
tttcccaagd taaacatttc aaggtaaaat 1080 
ctccagaagg caataccaac ctgaaagttc 1140 
tttcataata aatagacact tgagttaact 1200 
aacgtaatca aatatgtggt ccatctctga 1260 
gcagtcactt tgacaggttc cttttaagtg 1320 
aatttatata actgttgttg ataaggaaca 1380 
aaagatattt ttataggctt ggatgttttt 1440 
tttataatga ctggtcttcc ttacctggaa 1500 
cacaagtatc taaatttcca acttacaaag 1560 
gtctgttggc aaatttgtga actgtcatga 1620 
caatatattt ttactgcttt ctgaatgtag 1680 
taactcatcc gtttggccga tcgttgcaga 1740 
ttttgctcgt ccccgtggct taacccttct 1800 



<210> 346 
<211> 261 
<212> PRT 

<213> Homo sapiens 



<400> 346 



Met 


Asp 


Trp 


Gly 


Thr Leu His 


Thr 


1 








5 




Ser 


Thr 


Ser 


He 


Gly Lys Val 


Trp 








20 






Val 


Met 


He 


Leu 


Val Val Ala 


Ala 






35 






40 


Glu 


Asp 


Phe 


Val 


Cys Asn Thr 


Leu 




50 






55 




Tyr 


Asp 


His 


Phe 


Phe Pro Val 


Ser 


65 








70 




Leu 


He 


Phe 


Val 


Ser Thr Pro 


Ala 










85 




Tyr 


Tyr 


Arg 


His 


Glu Thr Thr 


Arg 








100 






Asn 


Asp 


Phe 


Lys 


Asp He Glu 


Asp 






115 






120 


Glu 


Gly 


Ser 


Leu 


Trp Trp Thr 


Tyr 




130 






135 




lie 


Phe 


Glu 


Ala 


Ala Phe Met 


Tyr 


145 








150 




Tyr 


His 


Leu 


Pro 


Trp Val Leu 


Lys 










165 




Leu 


Val 


Asp 


Cys 


Phe He Ser 


Arg 








180 






He 


Phe 


Met 


He 


Ser Ala Ser 


Val 






195 






200 



Phe 


He 


Gly 


Gly 


Val 


Asn 


Lys His 




10 










15 


He 


Thr 


Val 


He 


Phe 


He 


Phe Arg 


25 










30 




Gin 


Glu 


Val 


Trp 


Gly 


Asp 


Glu Gin 










45 






Gin 


Pro 


Gly 


Cys 


Lys 


Asn 


Val Cys 








60 








His 


He 


Arg 


Leu 


Trp 


Ala 


Leu Gin 






75 








80 


Leu 


Leu 


Val 


Ala 


Met 


His 


Val Ala 




90 










95 


Lys 


Phe 


Arg 


Arg 


Gly 


Glu 


Lys Arg 


105 










110 




He 


Lys 


Lys 


His 


Lys 


Val 


Arg He 










125 






Thr 


Ser 


Ser 


He 


Phe 


Phe 


Arg lie 








140 








Val 


Phe 


Tyr 


Phe 


Leu 


Tyr 


Asn Gly 






155 








160 


Cys 


Gly 


He 


Asp 


Pro 


Cys 


Pro - Asn 




170 










175 


Pro 


Thr 


Glu 


Lys 


Thr 


Val 


Phe Thr 


185 










190 




He 


Cys 


Met 


Leu 


Leu 


Asn 


Val Ala 



205 
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Glu Leu Cys Tyr Leu Leu Leu Lys Val Cys Phe Arg Arg Ser Lys Arg 

210 215 220 

Ala Gin Thr Gin Lys Asn His Pro Asn His Ala Leu Lys Glu Ser Lys 
225 230 235 . . 240 

Gin Asn Glu Met Asn Glu Leu lie Ser Asp Ser Gly Gin Asn Ala lie 

245 250 " 255 

Thr Gly Phe Pro Ser 
260 



<210> 347 

<211> 1740 

<212> DNA 

<213> Homo sapiens 

<400> 347 

atgaacaaac tgtatatcgg aaacctcagc gagaacgccg ccccctcgga cctagaaagt 60 

atcttcaagg acgccaagat cccggtgtcg ggacccttcc tggtgaagac tggctacgcg 120 

ttcgtggact gcccggacga gagctgggcc ctcaaggcca tcgaggcgct ttcaggtaaa 180 

atagaactgc acgggaaacc catagaagtt gagcactcgg tcccaaaaag gcaaaggatt 240 

cggaaacttc agatacgaaa tatcccgcct catttacagt gggaggtgct ggatagttta 300 

ctagtccagt atggagtggt ggagagctgt gagcaagtga acactgactc ggaaactgca 360 

gttgtaaatg taacctattc cagtaaggac caagctagac aagcactaga caaactgaat 420 

ggatttcagt tagagaattt caccttgaaa gtagcctata tccctgatga aacggccgcc 480 

cagcaaaacc ccttgcagca gccccgaggt cgccgggggc ttgggcagag gggctcctca 540 

aggcaggggt ctccaggatc cgtatccaag cagaaaccat gtgatttgcc tctgcgcctg 600 

ctggttccca cccaatttgt tggagccatc ataggaaaag aaggtgccac cattcggaac 660 

atcaccaaac agacccagtc taaaatcgat gtccaccgta aagaaaatgc gggggctgct 720 

gagaagtcga ttactatcct ctctactcct gaaggcacct ctgcggcttg taagtctatt 780 

ctggagatta tgcataagga agctcaagat ataaaattca cagaagagat ccccttgaag 840 

attttagctc ataataactt tgttggacgt cttattggta aagaaggaag aaatcttaaa 900 

aaaattgagc aagacacaga cactaaaatc acgatatctc cattgcagga attgacgctg 960 

tataatccag aacgcactat tacagttaaa ggcaatgttg agacatgtgc caaagctgag 1020 

gaggagatca tgaagaaaat cagggagtct tatgaaaatg atattgcttc tatgaatctt 1080 

caagcacatt taattcctgg attaaatctg aacgccttgg gtctgttccc acccacttca 1140 

gggatgccac ctcccacctc agggccccct tcagccatga ctcctcccta cccgcagttt 1200 

gagcaatcag aaacggagac tgttcatctg tttatcccag ctctatcagt cggtgccatc 1260 

atcggcaagc agggccagca catcaagcag ctttctcgct ttgctggagc ttcaattaag 1320 

attgctccag cggaagcacc agatgctaaa gtgaggatgg tgattatcac tggaccacca 1380 

gaggctcagt tcaaggctca gggaagaatt tatggaaaaa ttaaagaaga aaactttgtt 14 40 

agtcctaaag aagaggtgaa acttgaagct catatcagag tgccatcctt tgctgctggc 1500 

agagttattg gaaaaggagg caaaacggtg aatgaacttc agaatttgtc aagtgcagaa 1560 

gttgttgtcc ctcgtgacca gacacctgat gagaatgacc aagtggttgt caaaataact 1620 

ggtcacttct atgcttgcca ggttgcccag agaaaaattc aggaaattct gactcaggta 1680 

aagcagcacc aacaacagaa ggctctgcaa agtggaccac ctcagtcaag acggaagtaa 1740 



<210> 348 
<211> 579 
<212> PRT 

<213> Homo sapiens 
<400> 348 

Met Asn Lys Leu Tyr lie Gly Asn Leu Ser Glu Asn Ala Ala Pro Ser 

1 5 10 15 

Asp Leu Glu Ser lie Phe Lys Asp Ala Lys lie Pro Val Ser Gly Pro 

20 '25 " 30 

Phe Leu Val Lys Thr Gly Tyr Ala Phe Val Asp Cys Pro Asp Glu Ser 
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35 

Trp Ala Leu Lys Ala 
50 

Gly Lys Pro lie Glu 
65 

Arg Lys Leu Gin lie 
85 

Leu Asp Ser Leu Leu 
100 

Val Asn Thr Asp Ser 
115 

Lys Asp Gin Ala Arg 
130 

Glu Asn Phe Thr Leu 
145 

Gin Gin Asn Pro Leu 
165 

Arg Gly Ser Ser Arg 
180 

Pro Cys Asp Leu Pro 
195 

Ala He He Gly Lys 
210 

Thr Gin Ser Lys He 
225 

Glu Lys Ser He Thr 
245 

Cys Lys Ser He Leu 
260 

Phe Thr Glu Glu He 
275 

Gly Arg Leu He Gly 
290 

Asp Thr Asp Thr Lys 
305 

Tyr Asn Pro Glu Arg 
325 

Ala Lys Ala Glu Glu 
340 

Asn Asp He Ala Ser 
355 

Asn Leu Asn Ala Leu 
370 

Pro Thr Ser Gly Pro 
385 

Glu Gin Ser Glu Thr 
405 

Val Gly Ala He He 
420 

Arg Phe Ala Gly Ala 
435 

Ala Lys Val Arg Met 
450 

Lys Ala Gin Gly Arg 
465 

Ser Pro Lys Glu Glu 
485 

Phe Ala Ala Gly Arg 



40 

He Glu Ala Leu Ser 
55 

Val Glu His Ser Val 
70 

Arg Asn He Pro Pro 
90 

Val Gin Tyr Gly Val 
105 

Glu Thr Ala Val Val 
120 

Gin Ala Leu Asp Lys 
135 

Lys Val Ala Tyr He 
150 

Gin Gin Pro Arg Gly 
170 

Gin Gly Ser Pro Gly 
185 

Leu Arg Leu Leu Val 
200 

Glu Gly Ala Thr He 
215 

Asp Val His Arg" Lys 
230 

He Leu Ser Thr Pro 
250 

Glu He Met His Lys 
265 

Pro Leu Lys He Leu 
280 

Lys Glu Gly Arg Asn 
295 " I 

He Thr He Ser Pro 
310 

Thr He Thr Val Lys 
330 

Glu He Met Lys Lys 
345 

Met Asn Leu Gin Ala 
360 

Gly Leu Phe Pro Pro 
375 

Pro Ser Ala Met Thr 
390 

Glu Thr Val His Leu 
410 

Gly Lys Gin Gly Gin 
425 

Ser lie Lys lie Ala 
440 

Val He He Thr Gly 
455 

He Tyr Gly Lys lie 
470 

Val Lys Leu Glu Ala 
490 

Val lie Gly Lys Gly 



45 

Gly Lys He Glu Leu His 
60 

Pro Lys Arg Gin Arg lie 
75 80 
His Leu Gin Trp Glu Val 
95 

Val Glu Ser Cys Glu Gin 
110 

Asn Val Thr Tyr Ser Ser 
125 

Leu Asn Gly Phe Gin Leu 
140 

Pro Asp Glu Thr Ala Ala 
155 160 
Arg Arg Gly Leu Gly Gin 
175 

Ser Val Ser Lys Gin Lys 
190 

Pro Thr Gin Phe Val Gly 
205 

Arg Asn lie Thr. Lys Gin 
220 

Glu Asn Ala Gly Ala Ala 
235 240 
Glu Gly Thr Ser Ala Ala 

255* 

Glu Ala Gin Asp lie Lys 
270 

Ala His Asn Asn Phe Val 
285 

Leu Lys Lys lie Glu Gin 
300 

Leu Gin Glu Leu Thr Leu 
315 320 
Gly Asn Val Glu Thr Cys 
335 

lie Arg Glu Ser Tyr Glu 
350 

His Leu lie Pro Gly Leu 
365 

Thr Ser Gly Met Pro Pro 
380 

Pro Pro Tyr Pro Gin Phe 
395 400 
Phe He Pro Ala Leu Ser 
415 

His lie Lys Gin Leu Ser 
430 

Pro Ala Glu Ala Pro Asp 
445 

Pro Pro Glu Ala Gin Phe 
460 

Lys Glu Glu Asn Phe Val 
475 480 
His lie Arg Val Pro Ser 
495 

Gly Lys Thr Val Asn Glu 
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500 






Leu Gin Asn 


Leu 


Ser 


Ser Ala Glu 


515 






520 


Pro Asp Glu 


Asn Asp 


Gin Val Val 


530 






535 


Ala Cys Gin 


Val 


Ala 


Gin Arg Lys 


545 






550 


Lys Gin His 


Gin 


Gin 
565 


Gin Lys Ala 


Arg Arg Lys 









505 






510 






Val 


Val 


Val 


Pro Arg Asp 


Gin 


Thr 








525 






Val 


Lys 


He 


Thr Gly His 


Phe 


Tyr 








540 






He 


Gin 


Glu 


He Leu Thr 


Gin 


Val 






555 






560 


Leu 


Gin 


Ser 


Gly Pro Pro 


Gin 


Ser 




570 






575 





<210> 349 
<211> 207 
<212> DNA 

<213> Homo sapiens 
<400> 349 

atgtggcagc ccctcttctt caagtggctc ttgtcctgtt gccctgggag ttctcaaatt 60 
gctgcagcag cctccaccca gcctgaggat gacatcaata cacagaggaa gaagagtcag 120 
gaaaagatga gagaagttac agactctcct gggcgacccc gagagcttac cattcctcag 180 
acttcttcac atggtgctaa cagattt 207 

<210> 350 
<211> 69 
<212> PRT 

<213> Homo sapiens 
<400> 350 

Met Trp Gin Pro Leu Phe Phe Lys Trp Leu Leu Ser Cys Cys Pro Gly 

15 10 15 

Ser Ser Gin He Ala Ala Ala Ala Ser Thr Gin Pro Glu Asp Asp He 

20 25 30 

Asn Thr Gin Arg Lys Lys Ser Gin Glu Lys Met Arg Glu Val Thr Asp 

35 40 45 

Ser Pro Gly Arg Pro Arg Glu Leu Thr He Pro Gin Thr Ser Ser His 

50 55 60 

Gly Ala Asn Arg Phe 
65 



<210> 351 
<211> 1012 
<212> DNA 

<213> Homo sapiens 
<400> 351 

ccctctagaa ataattttgt ttaactttaa 
catcacacgg ccgcgtccga taacttccag 
ccgatcgggc aggcgatggc gatcgcgggc 
cctaccgcct tcctcggctt gggtgttgtc 
cgcgtggtcg .ggagcgctcc ggcggcaagt 
gcggtcgacg gcgctccgat caactcggcc 
catcccggtg acgtcatctc ggtgacctgg 
aacgtgacat tggccgaggg acccccggcc 
ttcatcgggg gtgtcaacaa acactccacc 
tttattttcc gagtcatgat cctcgtggtg 



gaaggagata tacatatgca tcaccatcac 60 

ctgtcccagg gtgggcaggg attcgccatt 120 

cagatcaagc ttcccaccgt tcatatcggg 180 

gacaacaacg gcaacggcgc acgagtccaa 240 

ctcggcatct ccaccggcga cgtgatcacc 300 

accgcgatgg cggacgcgct taacgggcat 360 

caaaccaagt cgggcggcac gcgtacaggg 420 

gaattcatgg attgggggac gctgcacact 480 

agcatcggga aggtgtggat cacagtcatc 540 

gctgcccagg aagtgtgggg tgacgagcaa 600 
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gaggacttcg tctgcaacac actgcaaccg ggatgcaaaa atgtgtgcta tgaccacttt 660 

ttcccggtgt cccacatccg gctgtgggcc ctccagctga tcttcgtctc caccccagcg 720 

ctgctggtgg ccatgcatgt ggcctactac aggcacgaaa ccactcgcaa gttcaggcga 780 

ggagagaaga ggaatgattt caaagacata gaggacatta aaaagcagaa ggttcggata 84 0 

gaggggtgac tcgagcacca ccaccaccac cactgagatc cggctgctaa caaagcccga 900 

aaggaagctg agttggctgc tgccaccgct gagcaataac tagcataacc ccttggggcc 960 

tctaaacggg tcttgagggg ttttttgctg aaaggaggaa ctatatccgg at 101: 

<210> 352 

<211> 267 

<212> PRT 

<213> Homo sapiens 



<400> 352 



Met 


His 


His 


His 


His His 


His Thr Ala Ala Ser Asp Asn 


Phe 


Gin 


Leu 


1 








5 


10 




15 




Ser 


Gin 


Gly 


Gly 


Gin Gly 


Phe Ala He Pro He Gly Gin 


Ala 


Met 


Ala 






20 




25 


30 






He 


Ala 


Gly 
35 


Gin 


He Lys 


Leu Pro Thr Val His He Gly 
40 45 


Pro 


Thr 


Ala 


Phe 


Leu 


Gly 


Leu 


Gly Val 


Val Asp Asn Asn Gly Asn Gly 


Ala Arg Val 




50 






55 60 








Gin 


Arg 


Val 


Val 


Gly Ser 


Ala Pro Ala Ala Ser Leu Gly 


He 


Ser 


Thr 


65 






70 


75 






80 


Gly 


Asp 


Val 


He 


Thr Ala 


Val Asp Gly Ala Pro He Asn 


Ser 


Ala 


Thr 






85 


90 




95 




Ala 


Met 


Ala 


Asp 
100 


Ala Leu 


Asn Gly His His Pro Gly Asp 
105 


Val 
110 


He 


Ser 


Val 


Thr 


Trp 
115 


Gin 


Thr Lys 


Ser Gly Gly Thr Arg Thr Gly 
120 125 


Asn 


Val 


Thr 


Leu 


Ala 


Glu 


Gly 


Pro Pro 


Ala Glu Phe Met Asp Trp Gly 


Thr 


Leu 


His 




130 






135 140 








Thr 


Phe 


He 


Gly 


Gly Val 


Asn Lys His Ser Thr Ser He 


Gly Lys Val 


145 








150 


155 






160 


Trp 


He 


Thr 


Val 


He Phe 


He Phe Arg Val Met He Leu 


Val 


Val 


Ala 








165 


170 




175 




Ala 


Gin 


Glu 


Val 


Trp Gly 


Asp Glu Gin Glu Asp Phe Val 


Cys Asn Thr 








180 




185 


190 






Leu 


Gin 


Pro 


Gly 


Cys Lys 


Asn Val Cys Tyr Asp His Phe 


Phe 


Pro 


Val 






195 




200 205 








Ser 


His 
210 


He 


Arg 


Leu Trp 


Ala Leu Gin Leu He Phe Val 
215 220 


Ser 


Thr 


Pro 


Ala 


Leu 


Leu 


Val 


Ala Met 


His Val Ala Tyr Tyr Arg His 


Glu 


Thr 


Thr 


225 








230 


235 






240 


Arg 


Lys 


Phe 


Arg 


Arg Gly 


Glu Lys Arg Asn Asp Phe Lys 


Asp 


He 


Glu 








245 


250 




255 




Asp 


He 


Lys 


Lys 
260 


Gin Lys 


Val Arg lie Glu Gly 
265 









<210> 353 
<211> 900 
<212> DNA 

<213> Homo sapiens 
<400> 353 

atgcatcacc atcaccatca cacggccgcg tccgataact tccagctgtc ccagggtggg 60 
cagggattcg ccattccgat cgggcaggcg atggcgatcg cgggccagat caagcttccc 120 
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accgttcata tcgggcctac cgccttcctc 
ggcgcacgag tccaacgcgt ggtcgggagc 
ggcgacgtga tcaccgcggt cgacggcgct 
gcgcttaacg ggcatcatcc cggtgacgtc 
ggcacgcgta cagggaacgt gacattggcc 
actcgcaagt tcaggcgagg agagaagagg 
aagcagaagg ttcggataga ggggtcgctg 
cgaatcatct ttgaagcagc ctttatgtat 
ctgccctggg tgttgaaatg tgggattgac 
tctaggccaa cagagaagac cgtgtttacc 
atgctgctta acgtggcaga gttgtgctac 
aagagagcac agacgcaaaa aaatcacccc 
gaaatgaatg agctgatttc agatagtggt 



ggcttgggtg ttgtcgacaa caacggcaac 18 0 
gctccggcgg caagtctcgg catctccacc 240 
ccgatcaact cggccaccgc gatggcggac 300 
atctcggtga cctggcaaac caagtcgggc 360 
gagggacccc cggccgaatt ccacgaaacc 420 
aatgatttca aagacataga ggacattaaa 480 
tggtggacgt acaccagcag catctttttc 540 
gtgttttact tcctttacaa tgggtaccac 600 
ccctgcccca accttgttga ctgctttatt 660 
atttttatga tttctgcgtc tgtgatttgc 720 
ctgctgctga aagtgtgttt taggagatca 780 
aatcatgccc taaaggagag taagcagaat 840 
caaaatgcaa tcacaggttt cccaagctaa 900 



<210> 354 
<211> 299 
<212> PRT 

<213> Homo sapiens 



<400> 354 



Met 


His 


His 


His 


His 


His 


His 


Thr 


1 








5 








Ser 


Gin 


Gly 


Gly 


Gin 


Gly 


Phe 


Ala 








20 










lie 


Ala 


Gly 


Gin 


He 


Lys 


Leu 


Pro 






35 










40 


Phe 


Leu 


Gly 


Leu 


Gly 


Val 


Val 


Asp 




50 










55 




Gin 


Arg 


Val 


Val 


Gly 


Ser 


Ala 


Pro 


65 










70 






Gly 


Asp 


Val 


He 


Thr 


Ala 


Val 


Asp 










85 








Ala 


Met 


Ala 


Asp 


Ala 


Leu 


Asn 


Gly 








100 










Val 


Thr 


Trp 


Gin 


Thr 


Lys 


Ser 


Gly 






115 










120 


Leu 


Ala 


Glu 


Gly 


Pro 


Pro 


Ala 


Glu 




130 










135 




Arg 


Arg 


Gly 


Glu 


Lys 


Arg 


Asn 


Asp 


145 










150 






Lys 


Gin 


Lys 


Val 


Arg 


He 


Glu 


Gly 










165 








Ser 


He 


Phe 


Phe 


Arg 


He 


He 


Phe 








180 








Tyr 


Phe 


Leu 


Tyr 


Asn 


Gly 


Tyr 


His 






195 










200 


He 


Asp 


Pro 


Cys 


Pro 


Asn 


Leu 


Val 




210 










215 




Glu 


Lys 


Thr 


Val 


Phe 


Thr 


He 


Phe 


225 










230 






Met 


Leu 


Leu 


Asn 


Val 


Ala 


Glu 


Leu 










245 








Phe 


Arg 


Arg 


Ser 


Lys 


Arg 


Ala 


Gin 








260 










Ala 


Leu 


Lys 


Glu 


Ser 


Lys 


Gin 


Asn 






275 










280 


Ser 


Gly 


Gin 


Asn 


Ala 


He 


Thr 


Gly 



Ala 


Ala 


Ser 


Asp 


Asn 


Pne 


Gin 


Leu 




10 










15 




He 


Pro 


He 


Gly 


Gin 


Ala 


Met 


Ala 


25 










30 






Thr 


Val 


His 


He 


Gly 


Pro 


Thr 


Ala 










45 








Asn 


Asn 


Gly 


Asn 


Gly 


Ala 


Arg 


Val 








60 










Ala 


Ala 


Ser 


Leu 


Gly 


He 


Ser 


Thr 






75 










80 


Gly 


Ala 


Pro 


He 


Asn 


Ser 


Ala 


Thr 




90 










95 




His 


His 


Pro 


Gly 


Asp 


Val 


He 


Ser 


105 










110 






Gly 


Thr 


Arg 


Thr 


Gly 


Asn 


Val 


Thr 










125 








Phe 


His 


Glu 


Thr 


Thr 


Arg 


Lys 


Phe 








140 










Phe 


Lys 


Asp 


He 


Glu 


Asp 


He 


Lys 






155 










160 


Ser 


Leu 


Trp 


Trp 


Thr 


Tyr 


Thr 


Ser 




170 










175 




Glu 


Ala 


Ala 


Phe 


Met 


Tyr 


Val 


Phe 


185 










190 






Leu 


Pro 


Trp 


Val 


Leu 


Lys 


Cys 


Gly 










205 








Asp 


Cys 


Phe 


He 


Ser 


Arg 


Pro 


Thr 








220 










Met 


He 


Ser 


Ala 


Ser 


Val 


He 


Cys 






235 










240 


Cys 


Tyr 


Leu 


Leu 


Leu 


Lys 


val 


Cys 




250 










255 




Thr 


Gin 


Lys 


Asn 


His 


Pro 


Asn 


His 


265 










270 






Glu 


Met 


Asn 


Glu 


Leu 


He 


Ser 


Asp 



285 



Phe Pro Ser 
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<210> 355 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 355 

ggagtacagc ttcaagacaa tggg 24 

<210> 356 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 356 

ccatgggaat tcattataat aattttgttc c 31 

<210> 357 
<211> 920 
<212> PRT 

<213> Homo sapiens 



<400> 357 



Met 


Gin 


His 


His 


His 


His 


His 


His 


Gly Val Gin Leu Gin Asp Asn Gly 


1 








5 








10 " 15 


Tyr 


Asn 


Gly 


Leu 


Leu 


He 


Ala 


He 


Asn Pro Gin Val Pro Glu Asn Gin 








20 










25 30 


Asn 


Leu 


He 


Ser 


Asn 


He 


Lys 


Glu 


Met He Thr Glu Ala Ser Phe Tyr 






35 










40 


45 


Leu 


Phe 


Asn 


Ala 


Thr 


Lys 


Arg 


Arg 


Val Phe Phe Arg Asn He Lys He 




50 










55 




60 


Leu 


He 


Pro 


Ala 


Thr 


Trp 


Lys 


Ala 


Asn Asn Asn Ser Lys lie Lys Gin 


65 










70 






75 80 


Glu 


Ser 


Tyr 


Glu 


Lys 


Ala 


Asn 


Val 


He Val Thr Asp Trp Tyr Gly Ala 










85 








90 95 


His 


Gly 


Asp 


Asp 


Pro 


Tyr 


Thr 


Leu 


Gin Tyr Arg Gly Cys Gly Lys Glu 








100 










105 110 


Gly 


Lys 


Tyr 


He 


His 


Phe 


Thr 


Pro 


Asn Phe Leu Leu Asn Asp Asn Leu 






115 










120 


125 


Thr 


Ala 


Gly 


Tyr 


Gly 


Ser 


Arg 


Gly 


Arg Val Phe Val His Glu Trp Ala 




130 










135 




140 


His 


Leu 


Arg 


Trp 


Gly 


Val 


Phe 


Asp 


Glu Tyr Asn Asn Asp Lys Pro Phe 


145 










150 






155 " 160 


Tyr 


He 


Asn 


Gly 


Gin 


Asn 


Gin 


He 


Lys Val Thr Arg Cys Ser Ser Asp 










165 








170 ~ 175 


lie 


Thr 


Gly 


He 


Phe 


Val 


Cys 


Glu 


Lys Gly -Pro Cys Pro Gin Glu Asn 








180 










185 190 


Cys 


He 


He 


Ser 


Lys 


Leu 


Phe 


Lys 


Glu Gly Cys Thr Phe He Tyr Asn 






195 










200 


205 


Ser 


Thr 


Gin 


Asn 


Ala 


Thr 


Ala 


Ser 


He Met Phe Met Gin Ser Leu Ser 
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210 




215 




Ser 


Val 


Val 


Glu Phe Cys Asn 


Ala 


225 






230 




Asn 


Leu 


Gin 


Asn Gin Met Cys 


Ser 








245 




Thr 


Asp 


Ser 


Ala Asp Phe His 


His 








260 




Leu 


Pro 


Pro 


Pro Pro Thr Phe 


Ser 






275 




280 


Val 


Cys 


Leu 


Val Leu Asp Val 


Ser 




290 




295 




Leu 


Leu 


Gin 


Leu Gin Gin Ala 


Ala 


305 






310 




Glu 


He 


His 


Thr Phe Val Gly 


lie 








325 




lie 


Arg 


Ala 


Gin Leu His Gin 


lie 








340 




Leu 


Val 


Ser 


Tyr Leu Pro Thr 


Thr 






355 




360 


He 


Cys 


Ser 


Gly Leu Lys Lys 


Gly 




370 




375 




Gly 


Lys 


Ala 


Tyr Gly Ser Val 


Met 


385 






390 




Lys 


Leu 


Leu 


Gly Asn Cys Leu 


Pro 








405 




He 


His 


Ser 


He Ala Leu Gly 


Ser 








420 




Leu 


Ser 


Arg 


Leu Thr Gly Gly 


Leu 






435 




440 


Asn 


Ser 


Asn 


Ser Met lie Asp 


Ala 




450 




455 




Gly 


Asp 


lie 


Phe Gin Gin His 


lie 


4 65 






470 




Val 


Lys 


Pro 


His His Gin Leu 


Lys 








485 




Val 


Gly 


Asn 


Asp Thr Met Phe 


Leu 








500 




Pro 


Glu 


He 


lie Leu Phe Asp 


Pro 






515 




520 


Asn 


Phe 


lie 


Thr Asn Leu Thr 


Phe 




530 




535 




Gly 


Thr 


Ala 


Lys Pro Gly His 


Trp 


545 






550 




His 


Ser 


Leu 


Gin Ala Leu Lys 


Val 








565 




Ser 


Ala 


Val 


Pro Pro Ala Thr 


Val 








580 




Leu 


His 


Phe 


Pro His Pro Val 


Met 






595 




600 


Phe 


Tyr 


Pro 


He Leu Asn Ala 


Thr 




610 




615 




Thr 


Gly 


Asp 


Pro Val Thr Leu 


Arg 


625 






630 




Asp 


Val 


lie 


Lys Asn Asp Gly 


lie 








645 




Ala 


Ala 


Asn 


Gly Arg Tyr Ser 


Leu 








660 




Ser 


He 


Ser 


Thr Pro Ala His 


Ser 



220 



Ser 


Thr 


His 


Asn 


Gin 


Glu 


Ala 


Pro 






235 










240 


Leu 


Arg 


Ser 


Ala 


Trp 


Asp 


Val 


lie 




250 










255 




Ser 


Phe 


Pro 


Met 


Asn 


Gly 


Thr 


Glu 


265 










270 






Leu 


Val 


Glu 


Ala 


Gly 


Asp 


Lys 


Val 










285 








Ser 


Lys 


Met 


Ala 


Glu 


Ala 


Asp 


Arg 








300 










Glu 


Phe 


Tyr 


Leu 


Met 


Gin 


lie 


Val 






315 










320 


Ala 


Ser 


Phe 


Asp 


Ser 


Lys 


Gly 


Glu 




330 










335 




Asn 


Ser 


Asn 


Asp 


Asp 


Arg 


Lys 


Leu 


345 










350 






Val 


Ser 


Ala 


Lys 


Thr 


Asp 


He 


Ser 










365 








Phe 


Glu 


Val 


Val 


Glu 


Lys 


Leu 


Asn 








380 










He 


Leu 


Val 


Thr 


Ser 


Gly 


Asp 


Asp 






395 










400 


Thr 


Val 


Leu 


Ser 


Ser 


Gly 


Ser 


Thr 




410 










415 




Ser 


Ala 


Ala 


Pro 


Asn 


Leu 


Glu 


Glu 


425 










430 






Lys 


Phe 


Phe 


Val 


Pro 


Asp 


lie 


Ser 










445 








Phe 


Ser 


Arg 


lie 


Ser 


Ser 


Gly 


Thr 








460 










Gin 


Leu 


Glu 


Ser 


Thr 


Gly 


Glu 


Asn 






475 










480 


Asn 


Thr 


Val 


Thr 


Val 


Asp 


Asn 


Thr 




490 










495 




Val 


Thr 


Trp 


Gin 


Ala 


Ser 


Gly 


Pro 


505 










510 






Asp 


Gly 


Arg 


Lys 


Tyr 


Tyr 


Thr 


Asn 










525 








Arg 


Thr 


Ala 


Ser 


Leu 


Trp 


lie 


Pro 








540 










Thr 


Tyr 


Thr 


Leu 


Asn 


Asn 


Thr 


His 






555 










560 


Thr 


Val 


Thr 


Ser 


Arg 


Ala 


Ser 


Asn 




570 










575 




Glu 


Ala 


Pne 


val 


CaXU 


Arg 


Asp 


Ser 


585 










590 






He 


Tyr 


Ala 


Asn 


Val 


Lys 


Gin 


Gly 










605 








Val 


Thr 


Ala 


Thr 


Val 


Glu 


Pro 


Glu 








620 










Leu 


Leu 


Asp 


Asp 


Gly 


Ala 


Gly 


Ala 






635 










640 


Tyr 


Ser 


Arg 


Tyr 


Phe 


Phe 


Ser 


Phe 




650 










655 




Lys 


Val 


His 


Val 


Asn 


His 


Ser 


Pro 


665 










670 






lie 


Pro 


Gly 


Ser 


His 


Ala 


Met 


Tyr 
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675 




680 


Val 


Pro 


Gly 


Tyr 


Thr Ala Asn Gly 




690 






695 


Lys 


Ser 


Val 


Gly 


Arg Asn Glu Glu 


705 








710 


Val 


Ser 


Ser 


Gly 


Gly Ser Phe Ser 










725 


His 


Pro Asp 


Val 


Phe Pro Pro Cys 








740 




Lys 


Val 


Glu 


Glu 


Glu Leu Thr Leu 






755 




760 


Phe 


Asp Gin 


Gly 


Gin Ala Thr Ser 




770 






775 


Leu 


Gin 


Asn 


He 


Gin Asp Asp Phe 


785 








790 


Ser 


Lys Arg 


Asn 


Pro Gin Gin Ala 










805 


Ser 


Pro 


Gin 


He 


Ser Thr Asn Gly 








820 




Thr 


His 


Glu 


Ser 


His Arg He Tyr 






835 




840 


Asn 


Ser 


Leu 


Gin 


Ser Ala Val Ser 




850 






855 


lie 


Pro 


Pro 


Asn 


Ser Asp Pro Val 


865 








870 


Lys 


Gly Val 


Leu 


Thr Ala Met Gly 










885 


He 


Val 


Val 


Thr 


His His Thr Leu 








900 




Lys 


Glu 


Asn 


Gly 


Thr Lys Leu Leu 






915 




920 



685 



Asn 


He 


Gin 


Met 
700 


Asn 


Ala 


Pro 


Arg 


Glu 


Arg 


Lys 
715 


Trp 


Gly 


Phe 


Ser 


Arg 
720 


Val 


Leu 
730 


Gly 


Val 


Pro 


Ala 


Gly 
735 


Pro 


Lys 


He 


lie 


Asp 


Leu 


Glu 


Ala 


Val 


745 










750 






Ser 


Trp 


Thr 


Ala 


Pro 
765 


Gly 


Glu 


Asp 


Tyr 


Glu 


He 


Arg 
780 


Met 


Ser 


Lys 


Ser 


Asn 


Asn 


Ala 
795 


He 


Leu 


Val 


Asn 


Thr 
800 


Gly 


He 
810 


Arg 


Glu 


He 


Phe 


Thr 
815 


Phe 


Pro 


Glu 


His 


Gin 


Pro 


Asn 


Gly 


Glu 


825 










830 






Val 


Ala 


He 


Arg 


Ala 
845 


Met 


Asp 


Arg 


Asn 


He 


Ala 


Gin 
860 


Ala 


Pro 


Leu 


Phe 


Pro 


Ala 


Arg 
875 


Asp 


Tyr 


Leu 


He 


Leu 
880 


Leu 


He 
890 


Gly 


He 


He 


Cys 


Leu 
895 


He 


Ser 


Arg 


Lys 


Lys 


Arg 


Ala 


Asp 


Lys 


905 










910 







<210> 358 

<211> 2773 

<212> DNA 

<213> Homo sapiens 

<400> 358 

catatgcagc atcaccacca tcaccacgga 
ttgctcattg caattaatcc tcaggtacct 
gaaatgataa ctgaagcttc attttaccta 
agaaatataa agattttaat acctgccaca 
caagaatcat atgaaaaggc aaatgtcata 
gatccataca ccctacaata cagagggtgt 
cctaatttcc tactgaatga taacttaaca 
gtccatgaat gggcccacct ccgttggggt 
ttctacataa atgggcaaaa tcaaattaaa 
atttttgtgt gtgaaaaagg tccttgcccc 
aaagaaggat gcacctttat ctacaatagc 
atgcaaagtt tatcttctgt ggttgaattt 
ccaaacctac agaaccagat gtgcagcctc 
gctgactttc accacagctt tcccatgaac 
tcgcttgtag aggctggtga caaagtggtc 
gcagaggctg acagactcct tcaactacaa 
gttgaaattc ataccttcgt gggcattgcc 
cagctacacc aaattaacag caatgatgat 
actgtatcag ctaaaacaga catcagcatt 



gtacagcttc aagacaatgg gtataatgga 60 
gagaatcaga acctcatctc aaacattaag 120 
tttaatgcta ccaagagaag agtatttttc 180 
tggaaagcta ataataacag caaaataaaa 240 
gtgactgact ggtatggggc acatggagat 300 
ggaaaagagg gaaaatacat tcatttcaca 360 
gctggctacg gatcacgagg ccgagtgttt 420 
gtgttcgatg agtataacaa tgacaaacct 480 
gtgacaaggt gttcatctga catcacaggc 540 
caagaaaact gtattattag taagcttttt 600 
acccaaaatg caactgcatc aataatgttc 660 
tgtaatgcaa gtacccacaa ccaagaagca 720 
agaagtgcat gggatgtaat cacagactct 7 80 
gggactgagc ttccacctcc tcccacattc 840 
tgtttagtgc tggatgtgtc cagcaagatg 900 
caagccgcag aattttattt gatgcagatt 960 
agtttcgaca gcaaaggaga gatcagagcc 1020 
cgaaagttgc tggtttcata tctgcccacc 1080 
tgttcagggc ttaagaaagg atttgaggtg 1140 
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gttgaaaaac tgaatggaaa agcttatggc tctgtgatga tattagtgac cagcggagat 1200 

gataagcttc ttggcaattg cttacccact gtgctcagca gtggttcaac aattcactcc 1260 

attgccctgg gttcatctgc agccccaaat ctggaggaat tatcacgtct tacaggaggt 1320 
ttaaagttct ttgttccaga tatatcaaac tccaatagca tgattgatgc tttcagtaga 1380 
atttcctctg gaactggaga cattttccag caacatattc agcttgaaag tacaggtgaa 1440 

aatgtcaaac ctcaccatca attgaaaaac acagtgactg tggataatac tgtgggcaac 1500' 

gacactatgt ttctagttac gtggcaggcc agtggtcctc ctgagattat attatttgat 1560 

cctgatggac gaaaatacta cacaaataat tttatcacca atctaacttt tcggacagct 1620 

agtctttgga ttccaggaac agctaagcct gggcactgga cttacaccct gaacaatacc 1680 

catcattctc tgcaagccct gaaagtgaca gtgacctctc gcgcctccaa ctcagctgtg 174 0 

cccccagcca ctgtggaagc ctttgtggaa agagacagcc tccattttcc tcatcctgtg 1800 

atgatttatg ccaatgtgaa acagggattt tatcccattc ttaatgccac tgtcactgcc 1860 

acagttgagc cagagactgg agatcctgtt acgctgagac tccttgatga tggagcaggt 1920 

gctgatgtta taaaaaatga tggaatttac tcgaggtatt ttttctcctt tgctgcaaat 1980 

ggtagatata gcttgaaagt gcatgtcaat cactctccca gcataagcac cccagcccac 2040 

tctattccag ggagtcatgc tatgtatgta ccaggttaca cagcaaacgg taatattcag 2100 

atgaatgctc caaggaaatc agtaggcaga aatgaggagg agcgaaagtg gggctttagc 2160 

cgagtcagct caggaggctc cttttcagtg ctgggagttc cagctggccc ccaccctgat 2220 

gtgtttccac catgcaaaat tattgacctg gaagctgtaa aagtagaaga ggaattgacc 2280 

ctatcttgga cagcacctgg agaagacttt gatcagggcc aggctacaag ctatgaaata 2340 

agaatgagta aaagtctaca gaatatccaa gatgacttta acaatgctat tttagtaaat 2400 

acatcaaagc gaaatcctca gcaagctggc atcagggaga tatttacgtt ctcaccccaa 2460 

atttccacga atggacctga acatcagcca aatggagaaa cacatgaaag ccacagaatt 2520 

tatgttgcaa tacgagcaat ggataggaac tccttacagt ctgctgtatc taacattgcc 2580 

caggcgcctc tgtttattcc ccccaattct gatcctgtac ctgccagaga ttatcttata 2640 

ttgaaaggag ttttaacagc aatgggtttg ataggaatca tttgccttat tatagttgtg 2700 

acacatcata ctttaagcag gaaaaagaga gcagacaaga aagagaatgg aacaaaatta 2760 
ttataatgaa ttc 2773 

<210> 359 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 359 

tggcagcccc tcttcttcaa gtggc 25 

<210> 360 
<211> 33 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PCR primer 
<400> 360 

cgccagaatt catcaaacaa atctgttagc acc 

<210> 361 
<211> 77 
<212> PRT 

<213> Homo sapiens 



33 



<400> 361 

Met Gin His His His His His His Trp Gin Pro Leu Phe Phe Lys Trp 
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15 10 15 

Leu Leu Ser Cys Cys Pro Gly Ser Ser Gin lie Ala Ala Ala Ala Ser 

20 25 30 

Thr Gin Pro Glu Asp Asp lie Asn Thr Gin Arg Lys Lys Ser Gin Glu 

35 40 45 

Lys Met Arg Glu Val Thr Asp Ser Pro Gly Arg Pro Arg Glu Leu Thr 

50 " 55 €0 

lie Pro Gin Thr Ser Ser His Gly Ala Asn Arg Phe Val 
65 70 75 



<210> 362 
<211> 244 
<212> DNA 
<213> Homo sapiens 



<400> 362 

catatgcagc atcaccacca tcaccactgg 
tgttgccctg ggagttctca aattgctgca 
aatacacaga ggaagaagag tcaggaaaag 
ccccgagagc ttaccattcc tcagacttct 
attc 



cagcccctct tcttcaagtg gctcttgtcc 60 
gcagcctcca cccagcctga ggatgacatc 120 
atgagagaag ttacagactc tcctgggcga 180 
tc'acatggtg ctaacagatt tgtttgatga 240 

244 



<210> 363 

<211> 20 

<212> PRT 

<213> Homo sapiens 



<400> 363 

Met Trp Gin Pro Leu Phe Phe Lys 

1 5 
Ser Ser Gin lie 
20 



Trp Leu Leu Ser Cys Cys Pro Gly 
10 15 



<210> 364 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<400> 364 

atgtggcagc ccctcttctt caagtggctc ttgtcctgtt gccctgggag ttctcaaatt 60 



<210> 365 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 365 

Gly Ser Ser Gin He Ala Ala Ala Ala Ser Thr Gin Pro Glu Asp Asp 

15 10 15 

He Asn Thr Gin 
20 



<210> 366 
<211> 60 
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<212> DNA 

<213> Homo sapiens 

<400> 366 

gggagttctc aaattgctgc agcagcctcc acccagcctg aggatgacat caatacacag 60 



<210> 367 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 367 

Lys Pro Gly His Trp Thr Tyr Thr Leu Asn Asn Thr His His Ser Leu 

1 5 10 15 

Gin Ala Leu Lys 
20 



<210> 368 

<211> 2343 

<212> DNA 

<213> Homo sapiens 

<400> 368 

attccggagc gtttgcggct tcgcttcatg 
gggagctggg gagcccgcag cggcccggag 
gcgccgcgcc tctgaggcgc agcatgtgaa 
tctcagccgg ccgggatggc taccacggcc 
gaatatattg aacgtcttgt atggagaacc 
gcttttgatc ctaaaagatt attagaagaa 
atggatgaaa ggattcagag gaaagtagag 
aaggaatttg ccaagaaggt acaagagctg 
ttccaagaac tagatgagca cattagctat 
cagttagagg gggtaaacac acccagacaa 
tactttaatg agtttctaga tggagaattg 
ataaaggaag cagcagacat cattcagaag 
gatagatttt cagaagttaa atccaaaatt 
ctgattcagg agtttaccag tgctcaaaga 
gcagcagttt tacttcattt taagggttat 
tgccaggagg gtgcttattt gagaaatgat 
agagtgaaca aacaagttgg agatatcttc 
attcaaaatg tatttgaaat caaactacag 
aggaagtccg atgcagagca atatctcaaa 
aatctttcca gcaagctgat ggagtttaat 
aagcttatca aatccatttt catttcctat 
tatttgaaaa gcagaagtgc tatgatccta 
aagagatcca ttggcacagg aggtattcaa 
aacttaccac ttgggccaag tatcgatact 
gtggttaatc ttttacaaga aaccaaacaa 
ccttctgact taccaaggaa tgccttcaga 
attgagcata ttgattatgc tttggaaaca 
aggaatgcaa atctttattt tttggacgtt 
tttgacaaac agtttaatga tcaccttatg 
gaatgccttc agaagaaaaa agaaataatt 
attgatagga cattaaattg tatgattgga 
aagaaaacag attttaagcc agaagatgaa 
tgtgtaaaag tctgtgctta cgtaagaaaa 



gccgctctcc cgcccctcct gggatctgtg 60 
ccggagctgg cgagccgagc ggagacctgt 120 
gcggagacgg catccagtgg ggggcgagcc 180 
gagctcttcg aggagccttt tgtggcagat 240 
ccaggaggag gctctagagg tggacctgaa 300 
tttgtaaatc atattcagga actccagata 360 
aaactagagc aacaatgtca gaaagaagcc 42 0 
cagaaaagca atcaggttgc cttccaacat 480 
gtagcaacta aagtctgtca ccttggagac 54 0 
cgggcagtgg aggctcagaa attgatgaaa 600 
aaatctgatg tttttacaaa ttctgaaaag 660 
ttgcacctaa ttgcccaaga gttacctttt 720 
gcaagtaaat accatgattt agaatgccag 780 
agaggtgaaa tctccagaat gagagaagta 840 
tcccattgtg ttgatgttta tataaagcag 900 
atatttgaag acgctggaat actctgtcaa 960 
agtaatccag aaacagtcct ggctaaactt 1020 
agttttgtga aagagcagtt agaagaatgt 1080 
aatctctatg atctgtatac aagaaccacc 1140 
ttaggtactg ataaacagac tttcttgtct 1200 
ttggagaact atattgaggt ggagactgga 1260 
cagcgctatt atgattcgaa aaaccatcaa 1320 
gatttgaagg aaagaattag acagcgtacc 1380 
catggggaga cttttctatc ccaagaagtg 1440 
gcctttgaaa gatgtcatag gctctctgat 1500 
atttttacca ttcttgtgga atttttatgt 1560 
ggacttgctg gaattccctc ttcagattct 1620 
gtgcaacagg ccaatactat ttttcatctt 1680 
ccactaataa gctcttctcc taagttatct 1740 
gaacaaatgg agatgaaatt ggatactggc 1800 
cagatgaagc atattttggc tgcagaacag 1860 
aacaatgttt tgattcaata tactaatgcc 1920 
caagtggaga agattaaaaa ttccatggat 1980 
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gggaagaatg tggatacagt tttgatggaa cttggagtac gttttcatcg acttatctat 2040 

gagcatcttc aacaatattc ctacagttgt atgggtggca tgttggccat ttgtgatgta 2100 

gccgaatata ggaagtgtgc caaagacttc aagattccaa tggtattaca tctttttgat 2160 

actctgcatg ctctttgcaa tcttctggta gttgccccag ataatttaaa gcaagtctgc 2220 

tcaggagaac aacttgctaa tctggacaag aatatacttc actccttcgt acaacttcgt 2280 

gctgattata gatctgcccg ccttgctcga cacttcagct gagattgaat ttacaaagga 2340 

att - ■ ■ - 2343 

<210> 369 
<211> 708 
<212> PRT 

<213> Homo sapiens 
<400> 369 

Met Ala Thr Thr Ala Glu Leu Phe Glu Glu Pro Phe Val Ala Asp Glu 

1 5 10 15 

Tyr lie Glu Arg Leu Val Trp Arg Thr Pro Gly Gly Gly Ser Arg Gly 

20 25 30 

Gly Pro Glu Ala Phe Asp Pro Lys Arg Leu Leu Glu Glu Phe Val Asn 

35 40 45 

His lie Gin Glu Leu Gin lie Met Asp Glu Arg He Gin Arg Lys Val 

50 55 60 

Glu Lys Leu Glu Gin Gin Cys Gin Lys Glu Ala Lys Glu Phe Ala Lys 
65 70 75 80 

Lys Val Gin Glu Leu Gin Lys Ser Asn Gin Val Ala Phe Gin His Phe 

85 90 95 

Gin Glu Leu Asp Glu His He Ser Tyr Val Ala Thr Lys Val Cys His 

100 105 110 

Leu Gly Asp Gin Leu Glu Gly Val Asn Thr Pro Arg Gin Arg Ala Val 

115 120 125 

Glu Ala Gin Lys Leu Met Lys Tyr Phe Asn Glu Phe Leu Asp Gly Glu 

130 135 140 

Leu Lys Ser Asp Val Phe Thr Asn Ser Glu Lys He Lys Glu Ala Ala 
145 " 150 155 160 

Asp He He Gin Lys Leu His Leu He Ala Gin Glu Leu Pro Phe Asp 

165 170 175 

Arg Phe Ser Glu Val Lys Ser Lys He Ala Ser Lys Tyr His Asp Leu 

180 ' 185 190 

Glu Cys Gin Leu He Gin Glu Phe Thr Ser Ala Gin Arg Arg Gly Glu 

195 200 , 205 

He Ser Arg Met Arg Glu Val Ala Ala Val Leu Leu His Phe Lys Gly 

210 215 220 

Tyr Ser His Cys Val Asp Val Tyr He Lys Gin Cys Gin Glu Gly Ala 
225 230 235 240 

Tyr Leu Arg Asn Asp He Phe Glu Asp Ala Gly He Leu Cys Gin Arg 

245 250 255 

Val Asn Lys Gin Val Gly Asp He Phe Ser Asn Pro Glu Thr Val Leu 

260 265 270 

Ala Lys Leu He Gin Asn Val Phe Glu He Lys Leu Gin Ser Phe Val 

275 280 285 

Lys Glu Gin Leu Glu Glu Cys Arg Lys Ser Asp Ala Glu Gin Tyr Leu 

290 295 300 

Lys Asn Leu Tyr Asp Leu Tyr Thr Arg Thr Thr Asn Leu Ser Ser Lys 
305 ' 310 315 320 

Leu Met Glu Phe Asn Leu Gly Thr Asp Lys Gin Thr Phe Leu Ser Lys 

325 330 335 

Leu He Lys Ser He Phe He Ser Tyr Leu Glu Asn Tyr He Glu Val 
340 345 350 
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Glu 


Thr 


Gly 
355 


Tyr 


Leu 


Lys Ser 


Arg 
360 


Ser 


Ala 


Met 


He 


Leu Gin 
365 


Arg 


Tyr 


Tyr 


Asp 
370 


Ser 


Lys 


Asn 


His Gin 
375 


Lys 


Arg 


Ser 


He 


Gly 
380 


Thr Gly 


Gly 


He 


Gin 


Asp 


Leu 


Lys 


Glu 


Arg He 


Arg 


Gin 


Arg 


Thr 


Asn 


Leu Pro 


Leu 


Gly 


385 










390 








395 








400 


Pro 


Ser 


He 


Asp 


Thr 
405 


His Gly 


Glu 


Thr 


Phe 
410 


Leu 


Ser 


Gin Glu 


Val 
415 


Val 


Val 


Asn 


Leu 


Leu 
420 


Gin 


Glu Thr 


Lys 


Gin 
425 


Ala 


Phe 


Glu 


Arg Cys 
430 


His 


Arg 


Leu 


Ser 


Asp 
435 


Pro 


Ser 


Asp Leu 


Pro 
440 


Arg 


Asn 


Ala 


Phe 


Arg He 
445 


Phe 


Thr 


He 


Leu 
450 


Val 


Glu 


Phe 


Leu Cys 
455 


He 


Glu 


His 


He 


Asp 
460 


Tyr ' Ala 


Leu 


Glu 


Thr 


Gly 


Leu 


Ala 


Gly 


He Pro 


Ser 


Ser 


Asp 


Ser 


Arg 


Asn Ala 


Asn 


Leu 


465 










470 








475 








480 


Tyr 


Phe 


Leu 


Asp 


Val 
485 


Val Gin 


Gin 


Ala 


Asn 
490 


Thr 


He 


Phe His 


Leu 
495 


Phe 


Asp 


Lys 


Gin 


Phe 
500 


Asn 


Asp His 


Leu 


Met 
505 


Pro 


Leu 


He 


Ser Ser 
510 


Ser 


Pro 


Lys 


Leu 


Ser 
515 


Glu 


Cys 


Leu Gin 


Lys 
520 


Lys 


Lys 


Glu 


He 


He Glu 
525 


Gin 


Met 


Glu 


Met 
530 


Lys 


Leu 


Asp 


Thr Gly 
535 


lie 


Asp 


Arg 


Thr 


Leu 
540 


Asn Cys 


Met 


He 


Gly 


Gin 


Met 


Lys 


His 


lie Leu 


Ala 


Ala 


Glu 


Gin 


Lys 


Lys Thr 


Asp 


Phe 


545 










550 








555 








560 


Lys 


Pro 


Glu 


Asp 


Glu 
565 


Asn Asn 


Val 


Leu 


He 
570 


Gin 


Tyr 


Thr Asn 


Ala 
575 


Cys 


Val 


Lys 


Val 


Cys 
580 


Ala 


Tyr Val 


Arg 


Lys 
585 


Gin 


Val 


Glu 


Lys He 
590 


L y s 


Asn 


Ser 


Met 


Asp 
595 


Gly 


Lys 


Asn Val 


Asp 
600 


Thr 


Val 


Leu 


Met 


Glu Leu 
605 


Gly 


Val 


Arg 


Phe 
610 


His 


Arg 


Leu 


He Tyr 
615 


Glu 


His 


Leu 


Gin 


Gin 
620 


Tyr Ser 


Tyr 


Ser 


Cys 


Met 


Gly 


Gly 


Met 


Leu Ala 


He 


Cys 


Asp 


Val 


Ala 


Glu Tyr 


Arg 


Lys 


625 










630 








635 








640 


Cys 


Ala 


Lys 


Asp 


Phe 
645 


Lys He 


Pro 


Met' 


Val 
650 


Leu 


His 


Leu Phe 


Asp 
655 


Thr 


Leu 


His 


Ala 


Leu 
660 


Cys 


Asn Leu 


Leu 


Val 
665 


Val 


Ala 


Pro 


Asp Asn 
670 


Leu 


Lys 


Gin 


Val 


Cys 
675 


Ser 


Gly 


Glu Gin 


Leu 
680 


Ala 


Asn 


Leu 


Asp 


Lys Asn 
685 


He 


Leu 


His 


Ser 
690 


Phe 


Val 


Gin 


Leu Arg 
695 


Ala 


Asp 


Tyr 


Arg 


Ser 
700 


Ala Arg 


Leu 


Ala 


Arg 


His 


Phe 


Ser 























705 



<210> 370 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 370 

gtcaatcact ctcccagcat aagcacccca gcccactcta ttccagggag tcatgctatg 60 



<210> 371 
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<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 371 

agtagaattt cctctggaac tggagacatt ttccagcaac atattcagct tgaaagtaca 60 



<210> 372 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 372 

ccagagactg gagatcctgt tacgctgaga ctccttgatg atggagcagg tgctgatgtt 60 

<210> 373 

<2U> 60 

<212> DNA 

<213> Homo sapiens 

<400> 373 

ttacagtctg ctgtatctaa cattgcccag gcgcctctgt ttattccccc caattctgat 60 



<210> 374 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 374 

gctgtgcccc cagccactgt ggaagccttt gtggaaagag acagcctcca ttttcctcat 60 



<210> 375 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 375 

aaaaacacag tgactgtgga taatactgtg ggcaacgaca ctatgtttct agttacgtgg 60 



<210> 376 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 376 

Leu Gin Ser Ala Val Ser Asn He Ala Gin Ala Pro Leu Phe He Pro 

15 10 15 

Pro Asn Ser Asp 
20 



<210> 377 
<211> 20 
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<212> PRT 

<213> Homo sapiens 
<400> 377 

Val Asn His Ser Pro Ser lie Ser Thr Pro Ala His Ser lie Pro Gly 

15 10 15 

Ser His Ala Met 
20 



<210> 378 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 378 

Pro Glu Thr Gly Asp Pro Val Thr Leu Arg Leu Leu Asp Asp Gly Ala 

15 10 15 

Gly Ala Asp Val 
20 



<210> 379 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 379 

Ala Val Pro Pro Ala Thr Val Glu Ala Phe Val Glu Arg Asp Ser Leu 

15 10 15 

His Phe Pro His 
20 



<210> 380 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 380 

Ser Arg He Ser Ser Gly Thr Gly Asp He Phe Gin Gin His He Gin 

1 5 10 15 

Leu Glu Ser Thr 
20 



<210> 381 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 381 

Lys Asn Thr Val Thr Val Asp Asn Thr Val Gly Asn Asp Thr Met Phe 

15 10 15 

Leu Val Thr Trp 
20 
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<210> 382 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 382 

Lys Pro Gly His Trp Thr Tyr Thr Leu Asn Asn Thr His His Ser Leu 

15 10 15 

Gin Ala Leu Lys 
20 



<210> 383 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 383 

cggcgaattc atggattggg ggacgctgc 

<210> 384 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 384 

cggcctcgag tcacccctct atccgaacct tctgc 

<210> 385 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 385 

cggcgaattc cacgaaccac tcgcaagttc ag 

<210> 386 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 386 

cggctcgagt tagcttgggc ctgtgattgc 



<210> 387 
<211> 20 
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<212> PRT 

<213> Homo sapiens 
<400> 387 

Phe Phe Lys Trp Leu Leu Ser Cys Cys Pro Gly Ser Ser Gin lie Ala 

1 5 10 15 

Ala Ala Ala Ser 
20 



<210> 388 
<211> 19 
<212> PRT 

<213> Homo sapiens 
<400> 388 

Leu Ser Cys Cys Pro Gly Ser Ser Gin He Ala Ala Ala Ser Thr Gin 

15 10 15 

Pro Glu Asp 



<210> 389 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 389 

Ala Ala Ala Ala Ser Thr Gin Pro Glu Asp Asp He Asn Thr Gin Arg 

15 10 15 

Lys Lys Ser Gin 
20 



<210> 390 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 390 

Thr Gin Pro Glu Asp Asp He Asn Thr Gin Arg Lys Lys Ser Gin Glu 

1 5 10 15 

Lys Met Arg Glu 
20 



<210> 391 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 391 

Asp He Asn Thr Gin Arg Lys Lys Ser Gin Glu Lys Met Arg Glu Val 

1 5 10 15 

Thr Asp Ser Pro 
20 
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<210> 392 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 392 

Arg Lys Lys Ser Gin Glu Lys Met Arg Glu Val Thr Asp Ser Pro Gly 

15 10 15 

Arg Pro Arg Glu 
20 



<210> 393 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 393 

Glu Lys Met Arg Glu Val Thr Asp Ser Pro Gly Arg Pro Arg Glu Leu 

15 10 15 

Thr lie Pro Gin 
20 



<210> 394 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 394 

Val Thr Asp Ser Pro Gly Arg Pro Arg Glu Leu Thr lie Pro Gin Thr 

15 10 15 

Ser Ser His Gly 
20 



<210> 395 

<211> 19 

<212> PRT 

<213> Homo sapiens 



<400> 395 

Gly Arg Pro Arg Glu Leu Thr lie Pro Gin Thr Ser Ser His Gly Ala 

15 10 15 

Asn Arg Phe 



<210> 396 

<211> 19 

<212> PRT 

<213> Homo sapiens 



<400> 396 

Met Asn Lys Leu Tyr lie Gly Asn Leu Ser Glu Asn Ala Ala Pro Ser 

15 10 15 

Asp Leu Glu 
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<210> 397 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 397 

Ser Glu Asn Ala Ala Pro Ser Asp Leu Glu Ser lie Phe Lys Asp Ala 

1 5 10 15 

Lys He Pro Val 
20 



<210> 398 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 398 

Ser He Phe Lys Asp Ala Lys He Pro Val Ser Gly Pro Phe Leu Val 

15 .10 15 

Lys Thr Gly Tyr 
20 



<210> 399 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 399 

Ser Gly Pro Phe Leu Val Lys Thr 

1 5 
Asp Glu Ser Trp 
20 



Gly Tyr Ala Phe Val Asp Cys Pro 
10 15 



<210> 400 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 400 

Ala Phe Val Asp Cys Pro Asp Glu 

1 5 
Ala Leu Ser Gly 
20 



Ser Trp Ala Leu Lys Ala He Glu 
10 15 



<210> 401 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 401 

Ala Leu Lys Ala He Glu Ala Leu Ser Gly Lys He Glu Leu His Gly 
15 10 15 
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173 



Lys Pro lie Glu 
20 



<210> 402 • 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 402 

Lys lie Glu Leu His Gly Lys Pro lie Glu Val Glu His Ser Val Pro 

1 5 10 15 

Lys Arg Gin Arg 
20 



<210> 403 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 403 

Val Glu His Ser Val Pro Lys Arg Gin Arg lie Arg Lys Leu Gin lie 

1 5 10 15 

Arg Asn lie Pro 
20 



<210> 404 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 404 

lie Arg Lys Leu Gin lie Arg Asn lie Pro Pro His Leu Gin Trp Glu 

15 10 15 

Val Leu Asp Ser 
20 



<210> 405 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 405 

Ala Val Val Asn Val Thr Tyr Ser Ser Lys Asp Gin Ala Arg Gin Ala 

1 5 10 15 

Leu Asp Lys Leu 
20 



<210> 406 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 406 
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Asp Gin Ala Arg Gin Ala Leu Asp Lys Leu Asn Gly Phe Gin Leu Glu 

1 5 ' " 10 15 

Asn Phe Thr Leu 
20 



<210> 407 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 407 

Asn Gly Phe Gin Leu Glu Asn Phe Thr Leu Lys Val Ala Tyr lie Pro 

1 5 10 15 

Asp Glu Thr Ala 
20 



<210> 408 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 408 

Lys Val Ala Tyr lie Pro Asp Glu Thr Ala Ala Gin Gin Asn Pro Leu 

15 10 15 

Gin Gin Pro Arg 
20 



<210> 409 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 409 

Ala Gin Gin Asn Pro Leu Gin Gin Pro Arg Gly Arg Arg Gly Leu Gly 

1 5 10 15 ' 

Gin Arg Gly Ser 
20 



<210> 410 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 410 

Gly Arg Arg Gly Leu Gly Gin Arg Gly Ser Ser Arg Gin Gly Ser Pro 

15 10 15 

Gly Ser Val Ser 
20 



<210> 411 

<211> 20 

<212> PRT 

<213> Homo sapiens 
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<400> 411 

Ser Arg Gin Gly Ser Pro Gly Ser Val Ser Lys Gin Lys Pro Cys Asp 

1 5 10 15 

Leu Pro Leu Arg 
20 



<210> 412 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 412 

Lys Gin Lys Pro Cys Asp Leu Pro Leu Arg Leu Leu Val Pro Thr Gin 

1 5 10 15 

Phe Val Gly Ala 
20 



<210> 413 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 413 

Leu Leu Val Pro Thr Gin Phe Val Gly Ala lie lie Gly Lys Glu Gly 

15 10 15 

Ala Thr He Arg 
20 



<210> 414 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 414 

He He Gly Lys Glu Gly Ala Thr He Arg Asn He Thr Lys Gin Thr 

1 5 10 15 

Gin Ser Lys He 
20 



<210> 415 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 415 

Asn He Thr Lys Gin Thr Gin Ser Lys He Asp Val His Arg Lys Glu 

1 5 10 15 

Asn Ala Gly Ala 
20 



<210> 416 
<211> 20 
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<212> PRT 

<213> Homo sapiens 
<400> 416 

Asp Val His Arg Lys Glu Asn Ala Gly Ala Ala Glu Lys Ser lie Thr 

15 10 15 

lie Leu Ser Thr 
20 



<210> 417 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 417 

Ala Glu Lys Ser lie Thr lie Leu 

1 5 
Ala Cys Lys Ser 
20 



Ser Thr Pro Glu Gly Thr Ser Ala 
10 ( 15 • 



<210> 418 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 418 

Pro Glu Gly Thr Ser Ala Ala Cys Lys Ser lie Leu Glu He Met His 

15 10 15 

Lys Glu Ala Gin 
20 



<210> 419 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 419 

He Leu Glu He Met His Lys Glu Ala Gin Asp He Lys Phe Thr Glu 

1 5 10 " "* 15 

Glu He Pro Leu 
20 



<210> 420 
<211> 455 
<212> DNA 
<213> Homo sapiens 

<400> 420 

gaagacatgc ttacttcccc ttcaccttcc ttcatgatgt gggaagagtg ctgcaaccca 60 
gccctagcca acgccgcatg agagggagtg tgccgagggc ttctgagaag gtttctctca 120 
catctagaaa gaagcgctta agatgtggca gcccctcttc ttcaagtggc tcttgtcctg 180 
ttgccctggg agttctcaaa ttgctgcagc agcctccacc cagcctgagg atgacatcaa 240 
tacacagagg aagaagagtc aggaaaagat gagagaagtt acagactctc ctgggcgacc 300 
ccgagagctt accattcctc agacttcttc acatggtgct aacagatttg ttcctaaaag 360 
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taaagctcta gaggccgtca aattggcaat agaagccggg ttccaccata ttgattctgc 420 
acatgtttac aataatgagg agcaggttgg actgg 4 55 

<210> 421 . 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 421 

actagtgtcc gcgtggcggc ctac 24 

<210> 422 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 

<400> 422 

catgagaatt catcacatgc ccttgaaggc tccc 34 

<210> 423 
<211> 161 
<212> PRT 

<213> Homo sapiens 



<400> 423 



Met 


Gin 


His His 


His His 


His 


His 


His 


Thr Ser Val Arg Val Ala Ala 


1 






5 








10 15 


Tyr 


Phe 


Glu Asn 


Phe Leu 


Ala 


Ala 


Trp 


Arg Pro Val Lys Ala Ser Asp 




20 








25 


30 


Gly 


Asp 


Tyr Tyr 


Thr Leu 


Ala 


Val 


Pro 


Met Gly Asp Val Pro Met Asp 




35 






40 




45 


Gly 


He 


Ser Val 


Ala Asp 


He 


Gly 


Ala 


Ala Val Ser Ser He Phe Asn 


50 






55 






60 


Ser 


Pro 


Glu Glu 


Phe Leu 


Gly 


Lys 


Ala 


Val Gly Leu Ser Ala Glu Ala 


65 






70 








75 80 


Leu 


Thr 


He Gin 


Gin Tyr 


Ala 


Asp 


Val 


Leu Ser Lys Ala Leu Gly Lys 








85 








90 95 


Glu 


Val 


Arg Asp 


Ala Lys 


lie 


Thr 


Pro 


Glu Ala Phe Glu Lys Leu Gly 






100 








105 


110 


Phe 


Pro 


Ala Ala 


Lys Glu 


He 


Ala 


Asn 


Met Cys Arg Phe Tyr Glu Met 






115 






120 




125 


Lys 


Pro 


Asp Arg 


Asp Val 


Asn 


Leu 


Thr 


His Gin Leu Asn Pro Lys Val 


130 




135 






140 


Lys 


Ser 


Phe Ser 


Gin Phe 


He 


Ser 


Glu 


Asn Gin Gly Ala Phe Lys Gly 


145 






150 








155 160 



Met 



<210> 424 
<211> 489 
<212> DNA 
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<213> Homo sapiens 
<400> 424 

atgcagcatc accaccatca ccaccacact agtgtccgcg tggcggccta ctttgaaaac 60 
tttctcgcgg cgtggcggcc cgtgaaagcc tctgatggag attactacac cttggctgta 120 
ccgatgggag atgtaccaat ggatggtatc tctgttgctg atattggagc agccgtctct 180 
agcattttta attctccaga ggaattttta ggcaaggccg tggggctcag tgcagaagca 240 
ctaacaatac agcaatatgc tgatgttttg tccaaggctt tggggaaaga agtccgagat 300 
gcaaagatta ccccggaagc tttcgagaag ctgggattcc ctgcagcaaa ggaaatagcc 360 
aatatgtgtc gtttctatga aatgaagcca gaccgagatg tcaatctcac ccaccaacta 420 
aatcccaaag tcaaaagctt cagccagttt atctcagaga accagggagc cttcaagggc 480 
atgtgatga 489 

<210> 425 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 425 

aacaaactgt atatcggaaa cctcagcgag aa 32 

<210> 426 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 426 

ccatagaatt cattacttcc gtcttgactg agg 33 

<210> 427 
<211> 586 
<212> PRT 

<213> Homo sapiens , 



<400> 427 




















Met 


Gin 


His 


His 


His 


His 


His 


His 


Asn 


Lys 


Leu Tyr He Gly Asn Leu 


1 








5 










10 


15 




Ser 


Glu 


Asn 


Ala 


Ala 


Pro 


Ser 


Asp 


Leu 


Glu 


Ser He Phe Lys Asp 


Ala 








20 










25 




30 




Lys 


He 


Pro 


Val 


Ser 


Gly Pro 


Phe 


Leu 


Val 


Lys Thr Gly Tyr Ala 


Phe 






35 










40 






.-45 




Val 


Asp 


Cys 


Pro 


Asp* 


Glu 


Ser 


Trp 


Ala 


Leu Lys Ala He Glu Ala Leu 




50 










55 








60 




Ser 


Gly 


Lys 


He 


Glu 


Leu 


His 


Gly 


Lys 


Pro 


He Glu Val Glu His 


Ser 


65 










70 










75 


80 


Val 


Pro 


Lys 


Arg 


Gin 


Arg He Arg 


Lys 


Leu 


Gin He Arg Asn He 


Pro 










85 










90 


95 




Pro 


His 


Leu 


Gin 


Trp 


Glu 


Val 


Leu 


Asp 


Ser 


Leu Leu Val Gin Tyr 


Gly 








100 










105 




110 




Val 


Val 


Glu 


Ser 


Cys 


Glu 


Gin 


Val 


Asn 


Thr Asp Ser Glu Thr Ala 


Val 






115 










120 






125 




Val 


Asn 


Val 


Thr 


Tyr 


Ser 


Ser 


Lys 


Asp 


Gin 


Ala Arg Gin Ala Leu 


Asp 
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130 



Lvs 


Leu 


Asn 


Glv 


145 








lie 


Pro 


Asp 


Glu 


Gly 


Arg 


Arg 


Gly 








180 


Gly 


Ser 


Val 


Ser 






195 




Val 


Pro 


Thr 


Gin 




210 






lie 


Ara 


Asn 


He 


225 








Lys 


Glu 


Asn 


Ala 


Pro 


Glu 


Gly 


Thr 








260 


Lys 


Glu 


Ala 


Gin 






275 




Leu 


Ala 


His 


Asn 




290 






Asn 


Leu 


Lys 


Lys 


305 




r 




Pro 


Leu 


Gin 


Glu 


Lys 


Gly 


Asn 


Val 








340 


Lys 


He 


Arg 


Glu 






355 




Ala 


His 


Leu 


He 




370 






Pro 


Thr 


Ser 


Glv 


385 








Thr 


Pro 


Pro 


Tyr 


Leu 


Phe 


He 


Pro 








420 


Gin 


His 


He 


Lys 






435 




Ala 


Pro 


Ala 


Glu 




450 






Glv 


Pro 


Pro 


Glu 


465 








lie 


Lys 


Glu 


Glu 


Ala 


His 


He 


Arg 








500 


Gly 


Gly 


Lys 


Thr 






515 




Val 


Val 


Pro 


Arg 




530 






Lys 


He 


Thr 


Gly 


545 








Gin 


Glu 


He 


Leu 


Gin 


Ser 


Gly 


Pro 



580 



135 




Phe Gin Leu 


Glu 


150 




Thr Ala Ala 


Gin 


165 




Leu Gly Gin 


Arg 


Lys Gin Lys 


Pro 




200 


Phe Val Gly 


Ala 


215 




Thr Lys Gin 


Thr 


230 




Gly Ala Ala 


Glu 


245 




Ser Ala Ala 


Cys 


Asp lie Lys 


Phe 




280 


Asn Phe Val 


Gly 


295 




He Glu Gin 


Asp 


310 




Leu Thr Leu 


Tyr 


325 




Glu Thr Cys 


Ala 


Ser Tyr Glu 


Asn 




360 


Pro Gly Leu 


Asn 


375 




Met Pro Pro 


Pro 


390 




Pro Gin Phe 


Glu 


405 




Ala Leu Ser 


Val 


Gin Leu Ser 


Arg 




440 


Ala Pro Asp 


Ala 


455 




Ala Gin Phe 


Lys 


470 




Asn Phe Val 


Ser 


485 




Val Pro Ser 


Phe 


Val Asn Glu 


Leu 




520 


Asp Gin Thr 


Pro 


535 




His Phe Tyr 


Ala 


550 




Thr Gin Val 


Lys 


565 




Pro Gin Ser 


Arg 



Asn 


Phe 


Thr 


Leu 






155 




Gin 


Asn 


Pro 


Leu 




170 






Gly 


Ser 


Ser 


Arg 


185 








Cys 


Asp 


Leu 


Pro 


He 


He 


Gly 


Lys 








220 


Gin 


Ser 


Lys 


He 






235 




Lys 


Ser 


He 


Thr 




250 






Lvs 


Ser 


He 


Leu 


265 








Thr 


Glu 


Glu 


He 


Arg 


Leu 


He 


Gly 








300 


Thr 


Asp 


Thr 


Lys 






315 




Asn 


Pro 


Glu 


Arg 




330 






Lys 


Ala 


Glu 


Glu 


345 








Asp 


He 


Ala 


Ser 


Leu 


Asn 


Ala 


Leu 








380 


Thr 


Ser 


Gly 


Pro 






395 




Gin 


Ser 


Glu 


Thr 




410 








Ala 


Tip 
lie 


lie 


425 








Phe 


Ala 


Gly 


Ala 


Lys 


Val 


Arg 


Met 








460 


Ala 


Gin 


Gly 


Arg 






475 




Pro 


Lys 


Glu 


Glu 




490 






Ala 


Ala 


Gly 


Arg 


505 








Gin 


Asn 


Leu 


Ser 


Asp 


Glu 


Asn 


Asp 








540 


Cys 


Gin 


Val 


Ala 






555 




Gin 


His 


Gin 


Gin 




570 






Arg 


Lys 







585 



Lys Val 


* 

Ala 


Tyr 








160 


Gin 


Gin 


Pro 


Arg 






175 




Gin Gly 


Ser 


Pro 




190 






Leu Arg 


Leu 


Leu 


205 








Glu Gly 


Ala 


Thr 


Asp Val 


His 


Arg 








240 


lie 


Leu 


Ser 


Thr 






255 




Glu 


He 


Met 


His 




270 






Pro 


Leu 


Lys 


He 


285 








Lys 


Glu 


Gly 


Arg 


He 


Thr 


He 


Ser 








320 


Thr 


He 


Thr 


Val 






335 




Glu 


He 


Met 


Lys 




-sen 

jjU 






Met 


Asn 


Leu 


Gin 


365 








Gly 


Leu 


Phe 


Pro 


Pro 


Ser 


Ala 


Met 








400 


Glu 


Thr 


Val 


His 






415 




Gly 


Lys 


Gin 


Gly 




430 






Ser 


He 


Lys 


He 


445 








Val 


He 


He 


Thr 


He 


Tyr 


Gly 


Lys 








480 


Val 


Lys 


Leu 


Glu 






4 95 




Val 


He 


Gly 


Lys 




510 






Ser 


Ala 


Glu 


Val 


525 








Gin 


Val 


Val 


Val 


Gin 


Arg 


Lys 


He 








560 


Gin 


Lys 


Ala 


Leu 






575 
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<210> 428 
<2li> 1764 
<212> DNA 

<213> Homo sapiens 
<400> 428 

atgcagcatc accaccatca ccacaacaaa ctgtatatcg gaaacctcag cgagaacgcc 60 

gccccctcgg acctagaaag tatcttcaag gacgccaaga tcccggtgtc gggacccttc 120 

ctggtgaaga ctggctacgc gttcgtggac tgcccggacg agagctgggc cctcaaggcc 180 

atcgaggcgc tttcaggtaa aatagaactg cacgggaaac ccatagaagt tgagcactcg 240 

gtcccaaaaa ggcaaaggat tcgg;aaactt cagatacgaa atatcccgcc tcatttacag 300 

tgggaggtgc tggatagttt actagtccag tatggagtgg tggagagctg tgagcaagtg 360 

aacactgact cggaaactgc agttgtaaat gtaacctatt ccagtaagga ccaagctaga 420 

caagcactag acaaactgaa tggatttcag ttagagaatt tcaccttgaa agtagcctat 480 

atccctgatg aaacggccgc ccagcaaaac cccttgcagc agccccgagg tcgccggggg 540 

cttgggcaga ggggctcctc aaggcagggg tctccaggat ccgtatccaa gcagaaacca 600 

tgtgatttgc ctctgcgcct gctggttccc acccaatttg ttggagccat cataggaaaa 660 

gaaggtgcca ccattcggaa catcaccaaa cagacccagt ctaaaatcga tgtccaccgt 720 

aaagaaaatg cgggggctgc tgagaagtcg attactatcc tctctactcc tgaaggcacc 780 

tctgcggctt gtaagtctat tctggagatt atgcataagg aagctcaaga tataaaattc 840 

acagaagaga tccccttgaa gattttagct cataataact ttgttggacg tcttattggt 900 

aaagaaggaa gaaatcttaa aaaaattgag caagacacag acactaaaat cacgatatct 960 

ccattgcagg aattgacgct gtataatcca gaacgcacta ttacagttaa aggcaatgtt 1020 

gagacatgtg ccaaagctga ggaggagatc atgaagaaaa tcagggagtc ttatgaaaat 1080 

gatattgctt ctatgaatct tcaagcacat ttaattcctg gattaaatct gaacgccttg 1140 

ggtctgttcc cacccacttc agggatgcca cctcccacct cagggccccc ttcagccatg 1200 

actcctccct acccgcagtt tgagcaatca gaaacggaga ctgttcatct gtttatccca 1260 

gctctatcag tcggtgccat catcggcaag cagggccagc acatcaagca gctttctcgc 1320 

tttgctggag cttcaattaa gattgctcca gcggaagcac cagatgctaa agtgaggatg 1380 

gtgattatca ctggaccacc agaggctcag ttcaaggctc agggaagaat ttatggaaaa 1440 

attaaagaag aaaactttgt tagtcctaaa gaagaggtga aacttgaagc tcatatcaga 1500 

gtgccatcct ttgctgctgg cagagttatt ggaaaaggag gcaaaacggt gaatgaactt 1560 

cagaatttgt caagtgcaga agttgttgtc cctcgtgacc agacacctga tgagaatgac 1620 

caagtggttg tcaaaataac tggtcacttc tatgcttgcc aggttgccca gagaaaaatt 1680 

caggaaattc tgactcaggt aaagcagcac caacaacaga aggctctgca aagtggacca 1740 

cctcagtcaa gacggaagta atga 1764 

<210> 429 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 429 

ccatggaatt cattatttca atataagata atctc 35 

<210> 430 
<211> 881 
<212> PRT 

<213> Homo sapiens 
<400> 430 

Met Gin His His His His His His Gly Val Gin 

15 10 
Tyr Asn Gly Leu Leu lie Ala lie Asn Pro Gin 



Leu Gin Asp Asn Gly 
15 

Val Pro Glu Asn Gin 
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20 




25 


30 




Asn 


Leu 


He 


Ser 


Asn 


He Lys Glu Met He 


-Thr Glu Ala Ser Phe 


Tyr 






35 






40 


45 




Leu 


Phe 


Asn 


Ala 


Thr 


Lys Arg Arg Val Phe 


Phe Arg Asn He Lys 


He 




50 








55 


60 




Leu 


He 


Pro 


Ala 


Thr 


Trp Lys Ala Asn Asn 


Asn Ser Lys He Lys 


Gin 


65 










70 


75 


80 


Glu 


Ser 


Tyr 


Glu 


Lys 


Ala Asn Val He Val 


Thr Asp Trp Tyr Gly 


Ala 










85 


90 


95 




His 


Gly 


Asp 


Asp 


Pro 


Tyr Thr Leu Gin Tyr 


Arg Gly Cys Gly Lys 


Glu 








100 




105 


110 




Gly 


Lys 


Tyr 


He 


His 


Phe Thr Pro Asn Phe 


Leu Leu Asn Asp Asn 


Leu 






115 






120 


125 




Thr 


Ala 


Gly 


Tyr Gly 


Ser Arg Gly Arg Val 


Phe Val His Glu Trp 


Ala 




130 








135 


140 




His 


Leu 


Arg 


Trp Gly 


Val Phe Asp Glu Tyr 


Asn Asn Asp Lys Pro 


Phe 


145 










150 


155 


160 


Tyr 


He 


Asn 


Gly Gin 


Asn Gin He Lys Val 


Thr Arg Cys Ser Ser 


Asp 










165 


170 


175 




lie 


Thr 


Gly 


He 


Phe 


Val Cys Glu Lys Gly 


Pro Cys. Pro Gin Glu 


Asn 








180 




185 


190 




Cys 


He 


He 


Ser 


Lys 


Leu Phe Lys Glu Gly 


Cys Thr Phe He Tyr 


Asn 






195 






200 


205 




Ser 


Thr 


Gin 


Asn 


Ala 


Thr Ala Ser He Met 


Phe Met Gin Ser Leu 


Ser 




210 








215 


220 




Ser 


Val 


Val 


Glu 


Phe 


Cys Asn Ala Ser Thr 


His Asn Gin Glu Ala 


Pro 


225 










230 


235 


240 


Asn 


Leu 


Gin 


Asn 


Gin 


Met Cys Ser Leu Arg 


Ser Ala Trp Asp Val 


He 










245 


250 


255 




Thr 


Asp 


Ser 


Ala Asp 


Phe His His Ser Phe 


Pro Met Asn Gly Thr 


Glu 








260 




265 


270 




Leu 


Pro 


Pro 


Pro 


Pro 


Thr Phe Ser Leu Val 


Glu Ala Gly Asp Lys 


Val 






275 






280 


285 




Val 


Cys 


Leu 


Val 


Leu 


Asp Val Ser Ser Lys 


Met Ala Glu Ala Asp 


Arg 




290 








295 


300 




Leu 


Leu 


Gin 


Leu 


Gin 


Gin Ala Ala Glu Phe 


Tyr Leu Met Gin He 


Val 


305 










310 


315 


320 


Glu 


He 


His 


Thr 


Phe 


Val Gly He Ala Ser 


Phe Asp Ser Lys Gly 


Glu 










325 


330 


335 




lie 


Arg 


Ala 


Gin 


Leu 


His Gin He Asn Ser 


Asn Asp Asp Arg Lys 


Leu 








340 




345 


350 




Leu 


Val 


Ser 


Tyr 


Leu 


Pro Thr Thr Val Ser 


Ala Lys Thr Asp He 


Ser 






355 






360 


365 




He 


Cys 


Ser 


Gly Leu 


Lys Lys Gly Phe Glu 


Val Val Glu Lys Leu 


Asn 




370 








375 


380 




Gly 


Lys 


Ala 


Tyr Gly 


Ser Val Met He Leu 


Val Thr Ser Gly Asp 


Asp 


n n c 

385 










390 


395 


4 UO 


Lys 


Leu 


Leu 


Gly Asn 


Cys Leu Pro Thr Val 


Leu Ser Ser Gly Ser 


Thr 










405 


410 


415 




He 


His 


Ser 


He 


Ala 


Leu Gly Ser Ser Ala 


Ala Pro Asn Leu Glu 


Glu 








420 




425 


430 




Leu 


Ser 


Arg 


Leu 


Thr 


Gly Gly Leu Lys Phe 


Phe Val Pro Asp He 


Ser 






435 






440 


445 




Asn 


Ser 


Asn 


Ser 


Met 


He Asp Ala Phe Ser 


Arg He Ser Ser Gly 


Thr 




450 








455 


460 




Gly 


Asp 


He 


Phe 


Gin 


Gin His He Gin Leu 


Glu Ser Thr Gly Glu 


Asn 


465 










470 


475 


480 


Val 


Lys 


Pro 


His 


His 


Gin Leu Lys Asn Thr 


Val Thr Val Asp Asn 


Thr 



WO 02/47534 



182 



PCT/US01/47576 









485 




490 


495 


Val 


Gly 


Asn Asp 


Thr Met Phe Leu 


Val 


Thr Trp Gin 


Ala Ser Gly Pro 






500 




505 




510 


Pro 


Glu 


lie He 


Leu Phe Asp Pro 


Asp Gly Arg Lys 


Tyr Tyr Thr Asn 






515 


520 






525 


Asn 


Phe 


He Thr 


Asn Leu Thr Phe 


Arg Thr Ala Ser 


Leu Trp He Pro 




530 




535 




540 




Gly 


Thr 


Ala Lys 


Pro Gly His Trp 


Thr 


Tyr Thr Leu 


Asn Asn Thr His 


545 






550 




555 


560 


His 


Ser 


Leu Gin 


Ala Leu Lys Val 


Thr 


Val Thr Ser 


Arg Ala Ser Asn 








c c c 

565 




570 


575 


Ser 


Ala 


Val Pro 


Pro Ala Thr Val 


Glu 


Ala Phe Val 


Glu Arg Asp Ser 






580 




585 




590 


Leu 


His 


Phe Pro 


His Pro Val Met 


He Tyr Ala Asn 


Val Lys Gin Gly 






595 


600 






605 


Phe 


Tyr 


Pro He 


Leu Asn Ala Thr 


Val 


Thr Ala Thr 


Val Glu Pro Glu 




610 




615 




620 




Thr 


Gly 


Asp Pro 


Val Thr Leu Arg 


Leu Leu Asp Asp 


Gly Ala Gly Ala 


625 






630 




635 


640 


Asp 


Val 


He Lys 


Asn Asp Gly He 


Tyr 


Ser Arg Tyr 


Phe Phe Ser Phe 








645 




650 


655 


Ala 


Ala 


Asn Gly 


Arg Tyr Ser Leu 


Lys Val His Val 


Asn His Ser Pro 






660 




665 




670 


Ser 


lie 


Ser Thr 


Pro Ala His Ser 


He 


Pro Gly Ser 


His Ala Met Tyr 






675 


680 






685 


Val 


Pro 


Gly Tyr 


Thr Ala Asn Gly 


Asn 


He Gin Met 


Asn Ala Pro Arg 




690 




695 




700 




Lys 


Ser 


Val Gly 


Arg Asn Glu Glu 


Glu Arg Lys Trp 


Gly Phe Ser Arg 


705 






710 




715 


720 


Val 


Ser 


Ser Gly 


Gly Ser Phe Ser 


Val 


Leu Gly Val 


Pro Ala Gly Pro- 








725 




730 


735 


His 


Pro 


Asp Val 


Phe Pro Pro Cys 


Lys 


He He Asp 


X — AH T » 1 _ ff _ I 

Leu Glu Ala Val 






740 




745 




750 


Lys 


Val 


Glu Glu 


Glu Leu Thr Leu 


Ser 


Trp Thr Ala 


Pro Gly Glu Asp 






755 


760 






765 


Phe 


Asp 


Gin Gly 


Gin Ala Thr Ser 


Tyr 


Glu He Arg 


Met Ser Lys Ser 




770 




775 




780 




Leu 


Gin 


Asn He 


Gin Asp Asp Phe 


Asn 


Asn Ala He 


Leu Val Asn Thr 


/ DO 






/ yu 




795 


ftnn 

ouu 


Ser 


Lys 


Arg Asn 


Pro Gin Gin Ala 


Gly 


He Arg Glu 


He Phe Thr Phe 








805 




810 


815 


Ser 


Pro 


Gin He 


Ser Thr Asn Gly 


Pro 


Glu His Gin 


Pro Asn Gly Glu 






820 




825 




830 


Thr 


His 


Glu Ser 


His Arg He Tyr 


Val 


Ala He Arg 


Ala Met Asp Arg 






835 


840 






845 


Asn 


Ser 


Leu Gin 


Ser Ala Val Ser 


Asn 


He Ala Gin 


Ala Pro Leu Phe 




850 




855 




860 




lie 


Pro 


Pro Asn 


Ser Asp Pro Val 


Pro Ala Arg Asp 


Tyr Leu lie Leu 


865 






870 




875 


880 


Lys 















<210> 431 

<211> 2646 

<212> DNA 

<213> Homo sapiens 
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<400> 431 

atgcagcatc accaccatca ccacggagta 
ctcattgcaa ttaatcctca ggtacctgag 
atgataactg aagcttcatt ttacctattt 
aatataaaga ttttaatacc tgccacatgg 
gaatcatatg aaaaggcaaa tgtcatagtg 
ccatacaccc tacaatacag agggtgtgga 
aatttcctac tgaatgataa cttaacagct 
catgaatggg cccacctccg ttggggtgtg 
tacataaatg ggcaaaatca aattaaagtg 
tttgtgtgtg aaaaaggtcc ttgcccccaa 
gaaggatgca cctttatcta caatagcacc 
caaagtttat cttctgtggt tgaattttgt 
aacctacaga accagatgtg cagcctcaga 
gactttcacc acagctttcc catgaacggg 
cttgtagagg ctggtgacaa agtggtctgt 
gaggctgaca gactccttca actacaacaa 
gaaattcata ccttcgtggg cattgccagt 
ctacaccaaa ttaacagcaa tgatgatcga 
gtatcagcta aaacagacat cagcatttgt 
gaaaaactga atggaaaagc ttatggctct 
aagcttcttg gcaattgctt acccactgtg 
gccctgggtt catctgcagc cccaaatctg 
aagttctttg ttccagatat atcaaactcc 
tcctctggaa ctggagacat tttccagcaa 
gtcaaacctc accatcaatt gaaaaacaca 
actatgtttc tagttacgtg gcaggccagt 
gatggacgaa aatactacac aaataatttt 
ctttggattc caggaacagc taagcctggg 
cattctctgc aagccctgaa agtgacagtg 
ccagccactg tggaagcctt tgtggaaaga 
atttatgcca atgtgaaaca gggattttat 
gttgagccag agactggaga tcctgttacg 
gatgttataa aaaatgatgg aatttactcg 
agatatagct tgaaagtgca tgtcaatcac 
attccaggga gtcatgctat gtatgtacca 
aatgctccaa ggaaatcagt aggcagaaat 
gtcagctcag gaggctcctt ttcagtgctg 
tttccaccat gcaaaattat tgacctggaa 
tcttggacag cacctggaga agactttgat 
atgagtaaaa gtctacagaa tatccaagat 
tcaaagcgaa atcctcagca agctggcatc 
tccacgaatg gacctgaaca tcagccaaat 
gttgcaatac gagcaatgga taggaactcc 
gcgcctctgt ttattccccc caattctgat 
aaataa 

<210> 432 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 432 

cgcctgctcg agtcattaat attcatcaga 



cagcttcaag acaatgggta taatggattg 60 
aatcagaacc tcatctcaaa cattaaggaa 120 
aatgctacca agagaagagt atttttcaga 180 
aaagctaata ataacagcaa aataaaacaa 240 
actgactggt atggggcaca tggagatgat 300 
aaagagggaa aatacattca tttcacacct 360 
ggctacggat cacgaggccg agtgtttgtc 420 
ttcgatgagt ataacaatga caaacctttc 480 
acaaggtgtt catctgacat cacaggcatt 540 
gaaaactgta ttattagtaa gctttttaaa 600 
caaaatgcaa ctgcatcaat aatgttcatg 660 
aatgcaagta cccacaacca agaagcacca 720 
agtgcatggg atgtaatcac agactctgct 780 
actgagcttc cacctcctcc cacattctcg 840 
ttagtgctgg atgtgtccag caagatggca 900 
gccgcagaat tttatttgat gcagattgtt 960 
ttcgacagca aaggagagat cagagcccag 1020 
aagttgctgg tttcatatct gcccaccact 1080 
tcagggctta agaaaggatt tgaggtggtt 1140 
gtgatgatat tagtgaccag cggagatgat 1200 
ctcagcagtg gttcaacaat tcactccatt 1260 
gaggaattat cacgtcttac aggaggttta 1320 
aatagcatga ttgatgcttt cagtagaatt 1380 
catattcagc ttgaaagtac aggtgaaaat 1440 
gtgactgtgg ataatactgt gggcaacgac 1500 
ggtcctcctg agattatatt atttgatcct 1560 
atcaccaatc taacttttcg gacagctagt 1620 
cactggactt acaccctgaa caatacccat 1680 
acctctcgcg cctccaactc agctgtgccc 1740 
gacagcctcc attttcctca tcctgtgatg 1800 
cccattctta atgccactgt cactgccaca 18 60 
ctgagactcc ttgatgatgg agcaggtgct 1920 
aggtattttt tctcctttgc tgcaaatggt 1980 
tctcccagca taagcacccc agcccactct 2040 
ggttacacag caaacggtaa tattcagatg 2100 
gaggaggagc gaaagtgggg ctttagccga 2160 
ggagttccag ctggccccca ccctgatgtg 2220 
gctgtaaaag tagaagagga attgacccta 2280 
cagggccagg ctacaagcta tgaaataaga 2340 
gactttaaca atgctatttt agtaaataca 2400 
agggagatat ttacgttctc accccaaatt 24 60 
ggagaaacac atgaaagcca cagaatttat 2520 
ttacagtctg ctgtatctaa cattgcccag 2580 
cctgtacctg ccagagatta tcttatattg 2640 

2646 



aaatgg 36 
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<210> 433 
<211> 371 
<212> PRT 

<213> Homo sapiens 



<400> 433 



Met 


bin 


His 


HIS 


His 


His His 


His 


i 








5 






Leu 


Leu 


Ser 


Cys 


Cys 


Pro Gly Ser 








on 








inr 


tain 


fro 


blU 


Asp 


Asp He 


Asn 






°, ^ 








40 


Lys 


Met 


Arg 


Pin 
bill 


val 


Thr Asp 


Ser 












55 




T 1 - 

lie 


Fro 


Gin 


Thr 


Ser 


Ser His 


Gly 


DO 










70 




Lys 


Ala 


Leu 


GlU 


Ala 


Val Lys 


Leu 










o c 

oo 






lie 


Asp 


Ser 


Ala 


His 


Val Tyr 


Asn 








1 UU 








lie 


Arg 


ser 


Lys 


lie 


Ala Asp Gly 






1 1 c 

110 








120 


Tyr 


inr 


C A v> 

oer 


jjys 


T nil 

Leu 


Trp Ser Asn 












135 




flO 


Aia 


Leu 


blU 


TV -v-^r 

Arg 


Ser Leu 


Lys 


1 ID 










150 




Leu 


Tyr 


T an 

Leu 


Ti- 
ne 


His 


Phe Pro 


Val 










Idj 






TI- 


Pro 


Lys 


Asp 


Glu 


Asn Gly Lys 








loU 








Cys 


Ala 


Thr 


Trp 


Glu 


Ala Met 


Glu 






135 








200 


Lys 


oer 


lie 


valy 


val 


Ser Asn 


Phe 




21U 








215 




T an 

jjeu 


Asn 


Lys 


Fro 


Gly 


Leu Lys 


Tyr 












230 




Cys 


HIS 


Pro 


Tyr 


Fne 


Asn Gin Arg 










245 






Lys 


Asp 


He 


Val 


Leu 


Val Ala 


Tyr 








260 








Glu 


Pro 


Trp 


Val 


Asp 


Pro Asn 


Ser 






275 








280 


Leu 


Cys 


Ala 


Leu 


Ala 


Lys Lys 


His 




290 








295 




Leu 


Arg 


Tyr 


Gin 


Leu 


Gin Arg 


Gly 


305 










310 




Asn 


Glu 


Gin 


Arg 


He 


Arg Gin 


Asn 










325 






Thr 


Ser 


Glu 


Glu 


Met 


Lys Ala 


He 








340 








Tyr 


Leu 


Thr 


Leu 


Asp 


He Phe 


Ala 






355 








360 


Asp 


Glu 


Tyr 












370 













Trp Gin 


Pro 


Leu Phe Phe Lys 


Trp 


10 




Id 




Ser Gin 


He 


Ala Ala Ala Ala 


Ser 


25 








Thr Gin Arg 


Lys Lys Ser Gin 


Glu 






45 




Pro Gly Arg 


Pro Arg Glu Leu 


Thr 






bU 




Ala Asn Arg 


Phe Val Pro Lys 


Ser 




75 




80 


Ala He 


Glu 


Ala Gly Phe His 


His 


90 




95 




Asn Glu 


Glu 


Gin Val Gly Leu 


Ala 


105 




110 




Ser Val Lys 


Arg Glu Asp He 


Phe 






IOC 

125 




Ser His Arg 


Pro Glu Leu Val 


Arg 






140 




Asn Leu 


Gin 


Leu Asp Tyr Val 


Asp 




155 




160 


Ser Val 


Lys 


Pro Gly Glu Glu 


Val 


170 




175 




He Leu 


Phe 


Asp Thr Val Asp 


Leu 


185 




190 




Lys Cys 


Lys 


Asp Ala Gly Leu 


Ala 






one 
205 




Asn His Arg 


Leu Leu Glu Met 


He 










Lys Pro 


Val 


Cys Asn Gin Val 


Glu 




235 




240 


Lys Leu 


Leu 


Asp Phe Cys Lys 


Ser 


250 




255 




Ser" Ala 


Leu 


Gly Ser His Arg 


Glu 


265 




270 




Pro Val 


Leu 


Leu Glu Asp Pro 


Val 






285 




Lys Arg 


Thr 


Pro Ala Leu He 


Ala 






300 




Val Val 


Val 


Leu Ala Lys Ser 


Tyr 




315 




320 


Val Gin 


Val 


Phe Glu Phe Gin 


Leu 


330 




335 




Asp Gly Leu 


Asn Arg Asn Val 


Arg 


345 




350 




Gly Pro 


Pro 


Asn Tyr Pro Phe 


Ser 



365 



<210> 434 
<211> 1119 
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<212> DNA 

<213> Homo sapiens 

<400> 434 

atgcagcatc accaccatca ccactggcag cccctcttct tcaagtggct cttgtcctgt 60 

tgccctggga gttctcaaat tgctgcagca gcctccaccc agcctgagga tgacatcaat 120 

acacagagga agaagagtca ggaaaagatg agagaagtta cagactctcc tgggcgaccc 180 

cgagagctta ccattcctca gacttcttca catggtgcta acagatttgt tcctaaaagt 240 

aaagctctag aggccgtcaa attggcaata gaagccgggt tccaccatat tgattctgca 300 

catgtttaca ataatgagga gcaggttgga ctggccatcc gaagcaagat tgcagatggc 360 

agtgtgaaga gagaagacat attctacact tcaaagcttt ggagcaattc ccatcgacca 420 

gagttggtcc gaccagcctt ggaaaggtca ctgaaaaatc ttcaattgga ctatgttgac 480 

ctctatctta ttcattttcc agtgtctgta aagccaggtg aggaagtgat cccaaaagat 540 

gaaaatggaa aaatactatt tgacacagtg gatctctgtg ccacatggga ggccatggag 600 

aagtgtaaag atgcaggatt ggccaagtcc atcggggtgt ccaacttcaa ccacaggctg 660 

ctggagatga tcctcaacaa gccagggctc aagtacaagc ctgtctgcaa ccaggtggaa 720 

tgtcatcctt acttcaacca gagaaaactg ctggatttct gcaagtcaaa agacattgtt 780 

ctggttgcct atagtgctct gggatcccat cgagaagaac catgggtgga cccgaactcc 840 

ccggtgctct tggaggaccc agtcctttgt gccttggcaa aaaagcacaa gcgaacccca 900 

gccctgattg ccctgcgcta ccagctgcag cgtggggttg tggtcctggc caagagctac . 960 

aatgagcagc gcatcagaca gaacgtgcag gtgtttgaat tccagttgac ttcagaggag 1020 

atgaaagcca tagatggcct aaacagaaat gtgcgatatt tgacccttga tatttttgct 1080 

ggccccccta attatccatt ttctgatgaa tattaatga 1119 

<210> 435 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 435 

ggatccgccg ccaccatgac atccattcga gctgta 36 

<210> 436 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 436 

gtcgactcag ctggaccaca gccgcag 27 

<210> 437 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 437 

ggatccgccg ccaccatgga ctcctggacc ttctgct 37 



<210> 438 
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<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 438 

gtcgactcag aaatcctttc tcttgac 27 

<210> 439 
<211> 933 
<212> DNA 
<213> Homo sapiens 

<400> 439 

atggactcct ggaccttctg ctgtgtgtcc ctttgcatcc tggtagcaaa gcacacagat 60 

gctggagtta tccagtcacc ccggcacgag gtgacagaga tgggacaaga agtgactctg 120 

agatgtaaac caatttcagg acacgactac cttttctggt acagacagac catgatgcgg 180 

ggactggagt tgctcattta ctttaacaac aacgttccga tagatgattc agggatgccc 240 

gaggatcgat tctcagctaa gatgcctaat gcatcattct ccactctgaa gatccagccc 300 

tcagaaccca gggactcagc tgtgtacttc tgtgccagca gtttagttgg agcaaacact 360 

gaagctttct ttggacaagg caccagactc acagttgtag aggacctgaa caaggtgttc 420 

ccacccgagg tcgctgtgtt tgagccatca gaagcagaga tctcccacac ccaaaaggcc 480 

acactggtgt gcctggccac aggcttcttc cctgaccacg tggagctgag ctggtgggtg 540 

aatgggaagg aggtgcacag tggggtcagc acggacccgc agcccctcaa ggagcagccc 600 

gccctcaatg actccagata ctgcctgagc agccgcctga gggtctcggc caccttctgg 660 

cagaaccccc gcaaccactt ccgctgtcaa gtccagttct acgggctctc ggagaatgac 720 

gagtggaccc aggatagggc caaacccgtc acccagatcg tcagcgccga ggcctggggt 780 

agagcagact gtggctttac ctcggtgtcc taccagcaag gggtcctgtc tgccaccatc 840 

ctctatgaga tcctgctagg gaaggccacc ctgtatgctg tgctggtcag cgcccttgtg 900 

ttgatggcca tggtcaagag aaaggatttc tga 933 

<210> 440 
<211> 822 
<212> DNA 

<213> Homo sapiens 
<400> 440 

atgacatcca ttcgagctgt atttatattc ctgtggctgc agctggactt ggtgaatgga 60 

gagaatgtgg agcagcatcc ttcaaccctg agtgtccagg agggagacag cgctgttatc 120 

aagtgtactt attcagacag tgcctcaaac tacttccctt ggtataagca agaacttgga 180 

aaaagacctc agcttattat agacattcgt tcaaatgtgg gcgaaaagaa agaccaacga 240 

attgctgtta cattgaacaa gacagccaaa catttctccc tgcacatcac agagacccaa 300 

cctgaagact cggctgtcta cttctgtgca gcaagtatac tgaacaccgg taaccagttc 360 

tattttggga cagggacaag tttgacggtc attccaaata tccagaaccc tgaccctgcc 420 

gtgtaccagc tgagagactc taaatccagt gacaagtctg tctgcctatt caccgatttt 480 

gattctcaaa caaatgtgtc acaaagtaag gattctgatg tgtatatcac agacaaaact 540 

gtgctagaca tgaggtctat ggacttcaag agcaacagtg ctgtggcctg gagcaacaaa 600 

tctgactttg catgtgcaaa cgccttcaac aacagcatta ttccagaaga caccttcttc 660 

cccagcccag aaagttcctg tgatgtcaag ctggtcgaga aaagctttga aacagatacg 720 

aacctaaact ttcaaaacct gtcagtgatt gggttccgaa tcctcctcct gaaagtggcc 780 

gggtttaatc tgctcatgac gctgcggctg tggtccagct ga 822 



<210> 441 
<211> 2311 
<212> DNA 



V 
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<213> Homo sapiens 



<400> 441 

gatttaatcc tatgacaaac taagttggtt ctgtcttcac ctgttttggt gaggttgtgt 60 
aagagttggt gtttgctcag gaagagattt aagcatgctt gcttacccag actcagagaa 12 0 
gtctccctgt tctgtcctag ctatgttcct gtgttgtgtg cattcgtctt ttccagagca 180 
aaccgcccag agtagaagat ggattggggc acgctgcaga cgatcctggg gggtgtgaac 240 
aaacactcca ccagcattgg aaagatctgg ctcaccgtcc tcttcatttt tcgcattatg 300 
atcctcgttg tggctgcaaa ggaggtgtgg ggagatgagc aggccgactt tgtctgcaac 360 
accctgcagc caggctgcaa gaacgtgtgc tacgatcact acttccccat ctcccacatc 420 
cggctatggg ccctgcagct gatcttcgtg tccagcccag cgctcctagt ggccatgcac 480 
gtggcctacc ggagacatga gaagaagagg aagttcatca agggggagat aaagagtgaa 540 
tttaaggaca tcgaggagat caaaacccag aaggtccgca tcgaaggctc cctgtggtgg 600 
acctacacaa gcagcatctt cttccgggtc atcttcgaag ccgccttcat gtacgtcttc 660 
tatgtcatgt acgacggctt ctccatgcag cggctggtga agtgcaacgc ctggccttgt 720 
cccaacactg tggactgctt tgtgtcccgg cccacggaga agactgtctt cacagtgttc 780 
atgattgcag tgtctggaat ttgcatcctg ctgaatgtca ctgaattgtg ttatttgcta 84 0 
attagatatt gttctgggaa gtcaaaaaag ccagtttaac gcattgccca gttgttagat 900 
taagaaatag acagcatgag agggatgagg caacccgtgc tcagctgtca aggctcagtc 960 
gccagcattt cccaacacaa agattctgac cttaaatgca accatttgaa acccctgtag 1020 
gcctcaggtg aaactccaga tgccacaatg agctctgctc ccctaaagcc tcaaaacaaa 1080 
ggcctaattc tatgcctgtc ttaattttct ttcacttaag ttagttccac tgagacccca 1140 
ggctgttagg ggttattggt gtaaggtact ttcatatttt aaacagagga tatcggcatt 1200 
tgtttctttc tctgaggaca agagaaaaaa gccaggttcc acagaggaca cagagaaggt 1260 
ttgggtgtcc tcctggggtt ctttttgcca actttcccca cgttaaaggt gaacattggt 1320 
tctttcattt gctttggaag ttttaatctc taacagtgga caaagttacc agtgccttaa 1380 
actctgttac actttttgga agtgaaaact ttgtagtatg ataggttatt ttgatgtaaa 14 40 
gatgttctgg ataccattat atgttccccc tgtttcagag gctcagattg taatatgtaa 1500 
atggtatgtc attcgctact atgatttaat ttgaaatatg gtettttggt tatgaatact 1560 
ttgcagcaca gctgagagag gctgtctgtt gtattcattg tggtcatagc acctaacaac 1620 
attgtagcct caatcgagtg agacagacta gaagttccta gttggcttat gatagcaaat 1680 
ggcctcatgt caaatattag atgtaatttt gtgtaagaaa tacagactgg atgtaccacc 17 40 
aactactacc tgtaatgaca ggcctgtcca acacatctcc cttttccatg ctgtggtagc 18 00 
cagcatcgga aagaacgctg atttaaagag gtgagcttgg gaattttatt gacacagtac 1860 
catttaatgg ggagacaaaa atgggggcca ggggagggag aagtttctgt cgttaaaaac 1920 
gagtttggaa agactggact ctaaattctg ttgattaaag atgagctttg tctaccttca 1980 
aaagtttgtt tggcttaccc ccttcagcct ccaatttttt aagtgaaaat ataactaata 2040 
acatgtgaaa agaatagaag ctaaggttta gataaatatt gagcagatct ataggaagat 2100 
tgaacctgaa tattgccatt atgcttgaca tggtttccaa aaaatggtac tccacatact 2160 
tcagtgaggg taagtatttt cctgttgtca agaatagcat tgtaaaagca ttttgtaata 2220 
ataaagaata gctttaatga tatgcttgta actaaaataa ttttgtaatg tatcaaatac 2280 
atttaaaaca ttaaaatata atctctataa t 2311 

<210> 442 

<211> 226 

<212> PRT 

<213> Homo sapiens 

<400> 442 

Met Asp Trp Gly Thr Leu Gin Thr He Leu Gly Gly Val Asn Lys His 
5 10 15 

Ser Thr Ser He Gly Lys He Trp Leu Thr Val Leu Phe He Phe Arg 
20 25 30 



He Met He Leu Val Val Ala Ala Lys Glu Val Trp Gly Asp Glu Gin 
35 40 45 
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Ala Asp Phe Val Cys Asn Thr Leu Gin Pro Gly Cys Lys Asn Val Cys 
50 55 60 

Tyr Asp His Tyr Phe Pro He Ser His He Arg Leu Trp Ala Leu Gin 
65 70 75 80 

Leu He Phe Val Ser Ser Pro Ala Leu Leu Val Ala Met His Val Ala 
85 90 95 

Tyr Arg Arg His Glu Lys Lys Arg Lys Phe He Lys Gly Glu He Lys 
100 105 110 

Ser Glu Phe Lys Asp He Glu Glu He Lys Thr Gin Lys Val Arg He 
115 120 125 

Glu Gly Ser Leu Trp Trp Thr Tyr Thr Ser Ser He Phe Phe Arg Val 
130 * 135 140 

He Phe Glu Ala Ala Phe Met Tyr Val Phe Tyr Val Met Tyr Asp Gly 
145 150 155 160 

Phe Ser Met Gin Arg Leu Val Lys Cys Asn Ala Trp Pro Cys Pro Asn 
165 170 175 

Thr Val Asp Cys Phe Val Ser Arg Pro Thr Glu Lys Thr Val Phe Thr 
180 185 190 

Val Phe Met lie Ala Val Ser Gly He Cys He Leu Leu Asn Val Thr 
195 200 205 

Glu Leu Cys Tyr Leu Leu He Arg Tyr Cys Ser Gly Lys Ser Lys Lys 
210 215 220 

Pro Val , 

225 



<210> 443 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<400> 443 

Val Lys Leu Cys Gly He Asp Pro Cys Pro Asn Leu Val Asp Cys Phe 
5 10 15 

He Ser Arg Pro Gly Cys Gly 
20 



<210> 444 
<211> 36 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PCR primer 
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30 



<400> 444 

caatcaggca tgcacaacaa actgtatatc ggaaac 36 

<210> 445 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 445 

cgtcaagatc ttcattactt ccgtcttgac 

<210> 446 
<211> 579 
<212> PRT 

<213> Homo sapiens 
<400> 446 

Met Asn Lys Leu Tyr lie Gly Asn Leu Ser Glu Asn Ala Ala Pro Ser 
5 10 15 

Asp Leu Glu Ser lie Phe Lys Asp Ala Lys lie Pro Val Ser Gly Pro 
20 25 30 

Phe Leu Val Lys Thr Gly Tyr Ala Phe Val Asp Cys Pro Asp Glu Ser 
35 40 45 

Trp Ala Leu Lys Ala lie Glu Ala Leu Ser Gly Lys lie Glu Leu His 
50 " 55 60 

9 

Gly Lys Pro He Glu Val Glu His Ser Val Pro Lys Arg Gin Arg He 
65 70 75 80 

Arg Lys Leu Gin He Arg Asn He Pro Pro His Leu Gin Trp Glu Val 
85 90 95 

Leu Asp Ser Leu Leu Val Gin Tyr Gly Val Val Glu Ser Cys Glu Gin 
100 105 HO 

Val Asn Thr Asp Ser Glu Thr Ala Val Val Asn Val Thr Tyr Ser Ser 
115 120 125 

Lys Asp Gin Ala Arg Gin Ala Leu Asp Lys Leu Asn Gly Phe Gin Leu 
130 135 140 

Glu Asn Phe Thr Leu Lys Val Ala Tyr He Pro Asp Glu Thr Ala Ala 
145 150 155 160 

Gin Gin Asn Pro Leu Gin Gin Pro Arg Gly Arg Arg Gly Leu Gly Gin 
165 170 175 

Arg Gly Ser Ser Arg Gin Gly Ser Pro Gly Ser Val Ser Lys Gin Lys 
180 185 190 
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Pro 


Cys 


Asp Leu 
195 


Pro 


Leu 


Arg 


Leu 
200 


Leu 


Val 


Pro 


Thr 


Gin 
205 


Phe 


Val 


Gly 


Ala 


He 
210 


He Gly 


Lys 


Glu 


Gly 
215 


Ala 


Thr 


He Arg 


Asn 
220 


He 


Thr 


Lys 


Gin 


Thr 

225 
Glu 


Gin 
Lys 


Ser Lys 
Ser He 


He 

Thr 
245 


Asp 
230 
He 


Val 
Leu 


His 
Ser 


Arg 
Thr 


Lys 

Pro 
250 


Glu 
235 
Glu 


Asn 
Gly 


Ala 
Thr 


Gly 
Ser 


Ala 

Ala 
255 


Ala 
240 
Ala 


Cys 


Lys 


Ser He 
260 


Leu 


Glu 


He 


Met 


His 
265 


Lys Glu 


Ala 


Gin 


Asp 
270 


He 


Lys 


Phe 


Thr 


Glu Glu 
275 


He 


Pro 


Leu 


Lys 
280 


He 


Leu 


Ala 


His 


Asn 
285 


Asn 


Phe 


Val 


Gly 


Arg 
290 


Leu He 


Gly 


Lys 


Glu 
295 


Gly 


Arg 


Asn 


Leu 


Lys 
300 


Lys 


He 


Glu 


Gin 


Asp 
305 


Thr 


Asp Thr 


Lys' 


He 
310 


Thr 


He 


Ser 


Pro 


Leu 

jIj 


Gin 


Glu 


Leu 


Thr 


Leu 
320 


Tyr 


Asn 


Pro Glu 


Arg 
325 


Thr 


He 


Thr 


Val 


Lys 
330 


Gly 


Asn 


Val 


Glu 


Thr 
335 


Cys 


Ala 


Lys 


Ala Glu 
340 


Glu 


Glu 


He 


Met 


Lys 
345 


Lys 


He 


Arg 


Glu 


Ser 
350 


Tyr 


Glu 


Asn 


Asp 


He Ala 
355 


Ser 


Met 


Asn 


Leu 
360 


Gin 


Ala 


His 


Leu 


He 
365 


Pro 


Gly 


Leu 


Asn 


Leu 
370 


Asn Ala 


Leu 


Gly 


Leu 
375 


Phe 


Pro 


Pro 


Thr 


Ser 
380 


Gly 


Met 


Pro 


Pro 


Pro 
385 


Thr 


Ser Gly 


Pro 


Pro 
390 


Ser 


Ala 


Met 


Thr 


Pro 
395 


Pro 


Tyr 


Pro 


Gin 


Phe 
400 


Glu 


Gin 


Ser Glu 


Thr 
405 


Glu 


Thr 


Val 


His 


Leu 
410 


Phe 


He 


Pro 


Ala 


Leu 
415 


Ser 


Val 


Gly 


Ala He 
420 


He 


Gly 


Lys 


Gin 


Gly 
425 


Gin 


His 


He 


Lys 


Gin 
430 


Leu 


Ser 


Arg 


Phe 


Ala Gly 
435 


Ala 


Ser 


He 


Lys 
440 


He 


Ala 


Pro 


Ala 


Glu 
445 


Ala 


Pro 


Asp 


Ala 


Lys 
450 


Val Arg 


Met 


Val 


He 
455 


He 


Thr 


Gly Pro 


Pro 
460 


Glu 


Ala 


Gin 


Phe 


Lys 
465 


Ala 


Gin Gly 


Arg 


He 
470 


Tyr 


Gly 


Lys 


He 


Lys 
475 


Glu 


Glu 


Asn 


Phe 


Val 
480 


Ser 


Pro 


Lys Glu 


Glu 
485 


Val 


Lys 


Leu 


Glu 


Ala 
490 


His 


He 


Arg 


Val 


Pro 
495 


Ser 


Phe 


Ala 


Ala Gly 
500 


Arg 


Val 


He 


Gly 


Lys 
505 


Gly Gly 


Lys 


Thr 


Val 
510 


Asn 


Glu 
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Leu Gin Asn Leu Ser Ser Ala Glu 
515 520 

Pro Asp Glu Asn Asp Gin Val Val 
530 • 535 

Ala Cys Gin Val Ala Gin Arg Lys 
545 550 

Lys Gin His Gin Gin Gin Lys Ala 
565 



Val Val Val Pro Arg Asp Gin Thr 
525 

Val Lys He Thr Gly His Phe Tyr 
540 

lie Gin Glu He Leu Thr Gin Val 
555 560 

Leu Gin Ser Gly Pro Pro Gin Ser 
570 575 



Arg Arg Lys 



<210> 447 

<211> 1743 

<212> DNA 

<213> Homo sapiens 

<400> 447 

atgaacaaac tgtatatcgg aaacctcagc 
atcttcaagg acgccaagat cccggtgtcg 
ttcgtggact gcccggacga gagctgggcc 
atagaactgc acgggaaacc catagaagtt 
cggaaacttc agatacgaaa tatcccgcct 
ctagtccagt atggagtggt ggagagctgt 
gttgtaaatg taacctattc cagtaaggac 
ggatttcagt tagagaattt caccttgaaa 
cagcaaaacc ccttgcagca gccccgaggt 
aggcaggggt ctccaggatc cgtatccaag 
ctggttccca cccaatttgt tggagccatc 
atcaccaaac agacccagtc taaaatcgat 
gagaagtcga ttactatcct ctctactcct 
ctggagatta tgcataagga agctcaagat 
attttagctc ataataactt tgttggacgt 
aaaattgagc aagacacaga cactaaaatc 
tataatccag aacgcactat tacagttaaa 
gaggagatca tgaagaaaat cagggagtct 
caagcacatt taattcctgg attaaatctg 
gggatgccac ctcccacctc agggccccct 
gagcaatcag aaacggagac tgttcatctg 
atcggcaagc agggccagca catcaagcag 
attgctccag cggaagcacc agatgctaaa 
gaggctcagt tcaaggctca gggaagaatt 
agtcctaaag aagaggtgaa acttgaagct 
agagttattg gaaaaggagg caaaacggtg 
gttgttgtcc ctcgtgacca gacacctgat 
ggtcacttct atgcttgcca ggttgcccag 
aagcagcacc aacaacagaa ggctctgcaa 
tga 



gagaacgccg ccccctcgga cctagaaagt 60 
ggacccttcc tggtgaagac tggctacgcg 120 
ctcaaggcca tcgaggcgct ttcaggtaaa 180 
gagcactcgg tcccaaaaag gcaaaggatt 240 
catttacagt gggaggtgct ggatagttta 300 
gagcaagtga acactgactc ggaaactgca 360 
caagctagac aagcactaga caaactgaat 420 
gtagcctata tccctgatga aacggccgcc 480 
cgccgggggc ttgggcagag gggctcctca 540 
cagaaaccat gtgatttgcc tctgcgcctg 600 
ataggaaaag aaggtgccac cattcggaac 660 
gtccaccgta aagaaaatgc gggggctgct 720 
gaaggcacct ctgcggcttg taagtctatt 780 
ataaaattca cagaagagat ccccttgaag 84 0 
cttattggta aagaaggaag aaatcttaaa 900 
acgatatctc cattgcagga attgacgctg 960 
ggcaatgttg agacatgtgc caaagctgag 1020 
tatgaaaatg atattgcttc tatgaatctt 1080 
aacgccttgg gtctgttccc acccacttca 1140 
tcagccatga ctcctcccta cccgcagttt 1200 
tttatcccag ctctatcagt cggtgccatc 12 60 
ctttctcgct ttgctggagc ttcaattaag 1320 
gtgaggatgg tgattatcac tggaccacca 1380 
tatggaaaaa ttaaagaaga aaactttgtt 14 4 0 
catatcagag tgccatcctt tgctgctggc 1500 
aatgaacttc agaatttgtc aagtgcagaa 1560 
gagaatgacc aagtggttgt caaaataact 1620 
agaaaaattc aggaaattct gactcaggta 1680 
agtggaccac ctcagtcaag acggaagtaa 1740 

1743 



<210> 448 
<211> 35 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 448 

cgtactagca tatgaacaaa ctgtatatcg gaaac 35 

<210> 449 
<211> 579 
<212> PRT 
<213> Homo sapiens 

<400> 449 



Met 


Asn 


Lys 


Leu 


Tyr 
5 


He 


Gly 


Asn 


Leu 


Ser 
10 


Glu Asn Ala Ala Pro Ser 
15 


Asp 


Leu 


Glu 


Ser 
20 


He 


Phe 


Lys 


Asp 


Ala 
25 


Lys 


He Pro Val Ser Gly Pro 
30 


Phe 


Leu 


Val 
35 


Lys 


Thr 


Gly 


Tyr 


Ala 
40 


Phe 


Val Asp Cys Pro Asp Glu Ser 
45 


Trp 


Ala 
50 


Leu 


Lys 


Ala 


He 


Glu 
55 


Ala 


Leu 


Ser 


Gly Lys He Glu Leu His 
60 


Gly 
65 


Lys 


Pro 


He 


Glu 


Val 

70 


Glu 


His 


Ser 


Val 


Pro Lys Arg Gin Arg He 
75 ' 80 


Arg 


Lys 


Leu 


Gin 


He 
85 


Arg 


Asn 


He 


Pro 


Pro 
90 


His Leu Gin Trp Glu Val 
95 


Leu 


Asp 


Ser 


Leu 
100 


Leu 


Val 


Gin 


Tyr 


Gly 
105 


Val 


Val Glu Ser Cys Glu Gin 
110 


Val 


Asn 


Thr 
115 


Asp 


Ser 


Glu 


Thr 


Ala 
120 


Val 


Val 


Asn Val Thr Tyr Ser Ser 
125 


Lys 


Asp 
130 


Gin 


Ala 


Arg 


Gin 


Ala 

135 


Leu 


Asp 


Lys 


Leu Asn Gly Phe Gin Leu 
140 


Glu 
145 


Asn 


Phe 


Thr 


Leu 


Lys 
150 


Val 


Ala 


Tyr 


He 


Pro Asp Glu Thr Ala Ala 
155 160 


Gin 


Gin 


Asn 


Pro 


Leu 
165 


Gin 


Gin 


Pro 


Arg 


Gly Arg Arg Gly Leu Gly Gin 
170 175 


Arg 


Gly 


Ser 


Ser 
180 


Arg 


Gin 


Gly 


Ser 


Pro 
185 


Gly Ser Val Ser Lys Gin Lys 
190 


Pro 


Cys 


Asp 
195 


Leu 


Pro 


Leu 


Arg 


Leu 
200 


Leu 


Val 


Pro Thr Gin Phe Val Gly 
205 


Ala 

Thr 
225 


He 
210 
Gin 


He 
Ser 


Gly 
Lys 


Lys 
He 


Glu 

Asp 
230 


Gly 
215 
Val 


Ala 

His 


Thr 
Arg 


He Arg Asn He Thr Lys Gin 
220 

Lys Glu Asn Ala Gly Ala' Ala 
235 240 
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Glu Lys Ser lie Thr lie Leu Ser Thr Pro Glu Gly Thr Ser Ala Ala 
245 250 255 

Cys Lys Ser lie Leu Glu lie Met His Lys Glu Ala Gin Asp lie Lys 
260 265 270 

Phe Thr Glu Glu lie Pro Leu Lys lie Leu Ala His Asn Asn Phe Val 
275 280 285 

Gly Arg Leu lie Gly Lys Glu Gly Arg Asn Leu Lys Lys lie Glu Gin 
290 295 300 

Asp Thr Asp Thr Lys lie Thr lie Ser Pro Leu Gin Glu Leu Thr Leu 
305 310 315 320 

Tyr Asn Pro Glu Arg Thr lie Thr Val Lys Gly Asn Val Glu Thr Cys 
325 330 335 

Ala Lys Ala Glu Glu Glu lie Met Lys Lys lie Arg Glu Ser Tyr Glu 
340 345 350 

Asn Asp lie Ala Ser Met Asn Leu Gin Ala His Leu lie Pro Gly Leu 
355 360 365 

Asn Leu Asn Ala Leu Gly Leu Phe Pro Pro Thr Ser Gly Met Pro Pro 
370 375 380 

Pro Thr Ser Gly Pro Pro Ser Ala Met Thr Pro Pro Tyr Pro Gin Phe 
385 390 395 400 

Glu Gin Ser Glu Thr Glu Thr Val His Leu Phe He Pro Ala Leu Ser 
405 410 415 

Val Gly Ala He He Gly Lys Gin Gly Gin His He Lys Gin Leu Ser 
420 425 430 

Arg Phe Ala Gly Ala Ser He Lys He Ala Pro Ala Glu Ala Pro Asp 
435 440 445 

Ala Lys Val Arg Met Val He He Thr Gly Pro Pro Glu Ala Gin Phe 

450 455 460 

Lys Ala Gin Gly Arg lie Tyr Gly Lys He Lys Glu Glu Asn Phe Val 

465 470 475 480 

Ser Pro Lys Glu Glu Val Lys Leu Glu Ala His He Arg Val Pro Ser 
485 490 495 

Phe Ala Ala Gly Arg Val He Gly Lys Gly Gly Lys Thr Val Asn Glu 
500 505 510 

Leu Gin Asn Leu Ser Ser Ala Glu Val Val Val Pro Arg Asp Gin Thr 
515 520 525 

Pro Asp Glu Asn Asp Gin Val Val Val Lys He Thr Gly His Phe Tyr 
530 535 540 



Ala Cys Gin Val Ala Gin Arg Lys He Gin Glu lie Leu Thr Gin Val 
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545 550 555 560 

Lys Gin His Gin Gin Gin Lys Ala Leu Gin Ser Gly Pro Pro Gin Ser 
565 570 575 

Arg Arg Lys 



<210> 450 

<211> 1743 

<212> DNA 

<213> Homo sapiens 

<400> 450 

atgaacaaac tgtatatcgg aaacctcagc gagaacgccg ccccctcgga cctagaaagt 60 
atcttcaagg acgccaagat cccggtgtcg ggacccttcc tggtgaagac tggctacgcg 120 
ttcgtggact gcccggacga gagctgggcc ctcaaggcca tcgaggcgct ttcaggtaaa 180 
atagaactgc acgggaaacc catagaagtt gagcactcgg tcccaaaaag gcaaaggatt 240 
cggaaacttc agatacgaaa/ tatcccgcct catttacagt gggaggtgct ggatagttta 300 
ctagtccagt atggagtggt ggagagctgt gagcaagtga acactgactc ggaaactgca 360 
gttgtaaatg taacctattc cagtaaggac caagctagac aagcactaga caaactgaat 420 
ggatttcagt tagagaattt caccttgaaa gtagcctata tccctgatga aacggccgcc 480 
cagcaaaacc ccttgcagca gccccgaggt cgccgggggc ttgggcagag gggctcctca 540 
aggcaggggt ctccaggatc cgtatccaag cagaaaccat gtgatttgcc tctgcgcctg 600 
ctggttccca cccaatttgt tggagccatc ataggaaaag aaggtgccac cattcggaac 660 
atcaccaaac agacccagtc taaaatcgat gtccaccgta aagaaaatgc gggggctgct 720 
gagaagtcga ttactatcct ctctactcct gaaggcacct ctgcggcttg taagtctatt 780 
ctggagatta tgcataagga agctcaagat ataaaattca cagaagagat ccccttgaag 840 
attttagctc ataataactt tgttggacgt cttattggta aagaaggaag aaatcttaaa 900 
aaaattgagc aagacacaga cactaaaatc acgatatctc cattgcagga attgacgctg 960 
tataatccag aacgcactat tacagttaaa ggcaatgttg agacatgtgc caaagctgag 1020 
gaggagatca tgaagaaaat cagggagtct tatgaaaatg atattgcttc tatgaatctt 1080 
caagcacatt taattcctgg attaaatctg aacgccttgg gtctgttccc acccacttca 1140 
gggatgccac ctcccacctc agggccccct tcagccatga ctcctcccta cccgcagttt 1200 
gagcaatcag aaacggagac tgttcatctg tttatcccag ctctatcagt cggtgccatc 1260 
atcggcaagc agggccagca catcaagcag ctttctcgct ttgctggagc ttcaattaag 1320 
attgctccag cggaagcacc agatgctaaa gtgaggatgg tgattatcac tggaccacca 1380 
gaggctcagt tcaaggctca gggaagaatt tatggaaaaa ttaaagaaga aaactttgtt 1440 
agtcctaaag aagaggtgaa acttgaagct catatcagag tgccatcctt tgctgctggc 1500 
agagttattg gaaaaggagg caaaacggtg aatgaacttc agaatttgtc aagtgcagaa 1560 
gttgttgtcc ctcgtgacca gacacctgat gagaatgacc aagtggttgt caaaataact 1620 
ggtcacttct atgcttgcca ggttgcccag agaaaaattc aggaaattct gactcaggta 1680 
aagcagcacc aacaacagaa ggctctgcaa agtggaccac ctcagtcaag acggaagtaa 1740 
tga ^ i743 



<210> 451 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 451 

Leu Gly Lys Glu Val Arg Asp Ala Lys He Thr Pro Glu Ala Phe Glu 
5 10 15 

Lys Leu Gly Phe Pro Ala Ala Lys Glu 
20 25 
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<210> 452 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 452 

Lys Ala Ser Asp Gly Asp Tyr Tyr Thr Leu Ala Val Pro Met Gly Asp 
5 10 15 

Val Pro Met Asp Gly He Ser Val Ala 
20 25 



<210> 453 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 453 

Pro Asp Arg Asp Val Asn Leu Thr His Gin Leu Asn Pro Lys Val Lys 
5 10 15 



<210> 454 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 454 

Lys He Ala Pro Ala Glu Ala Pro Asp Ala Lys Val Arg Met Val He 
5 J 10 ' ■ " 15 



He Thr Gly Pro 
20 



<210> 455 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 455 

Pro Asp Glu Thr Ala Ala Gin Gin Asn Pro Leu Gin Gin Pro Arg Gly 
5 10 15 

Arg Arg Gly Leu 
20 



<210> 456 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 456 

Arg Thr He Thr Val Lys Gly Asn Val Glu Thr Cys Ala Lys Ala Glu 
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5 10 15 

Glu Glu lie Met 
20 

<210> 457 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 457 

Ala Phe Val Asp Cys Pro Asp Glu Ser Trp Ala Leu Lys Ala lie Glu 
5 10 15 

Ala Leu Ser Gly 
20 



<210> 458 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 458 

lie Arg Lys Leu Gin lie Arg Asn lie Pro Pro His Leu Gin Trp Glu 
5 10 15 

Val Leu Asp Ser 
20 



<210> 459 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 459 

Ala Gin Gin Asn Pro Leu Gin Gin Pro Arg Gly Arg Arg Gly Leu Gly 
5 10 15 

Gin Arg Gly Ser 
20 



<210> 460 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 460 

Asp Val His Arg Lys Glu Asn Ala Gly Ala Ala Glu Lys Ser He Thr 
5 10 15 

He Leu Ser Thr 
20 



<210> 461 
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<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 461 

Leu Tyr Asn Pro Glu Arg Thr lie Thr Val Lys Gly Asn Val Glu Thr 
5 10 " 15 

Cys Ala Lys Ala 
20 



<210> 462 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 462 

Glu Glu Glu He Met Lys Lys He Arg Glu Ser Tyr Glu Asn Asp He 
5 10 15 

Ala Ser Met Asn 
20 



<210> 463 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 463 

Leu Asn Ala Leu Gly Leu Phe Pro Pro Thr Ser Gly Met Pro Pro Pro 
5 10 15 

Thr Ser Gly Pro 
20 



<210> 464 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 464 

Lys He Ala Pro Ala Glu Ala Pro Asp Ala Lys Val Arg Met Val He 
5 10 15 

He Thr Gly Pro 
20 

<210> 465 

<211> 18 

<212> PRT 

<213> Homo sapiens 

<400> 465 

Thr Gly Tyr Ala Phe Val Asp Cys Pro Asp Glu Ser Trp Ala Leu Lys He 
5 10 15 
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Glu 



<210> 466 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 466 

Phe Val Asp Cys Pro Asp Glu Ser Trp Ala Leu 
5 10 

<210> 467 

<211> 33 

<212> DNA 

<213> Homo sapiens 



<400> 467 

ttcgtggact gcccggacga gagctgggcc etc 



33 



<210> 468 
<211> 24 
<212> PRT 

<213> Homo sapiens 
<400> 468 

lie Pro Asp Glu Met Ala Ala Gin Gin Asn Pro Leu Gin Gin Pro Arg 
5 10 15 

Gly Arg Arg Gly Leu Gly Gin Arg 
20 



<210> 469 
<211> 24 
<212> PRT 

<213> Homo sapiens 



<400> 469 

lie Pro Asp Glu Thr Ala Ala Gin Gin Asn Pro Ser Pro Gin Leu Arg 
5 10 15 



Gly Arg Arg Gly Pro Gly Gin Arg 
20 



\ 



> 



